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Hormone -Hormone Receptor Complexes and Nucleic 
Acid Constructs and Their Use in Gene Therapy 

5 

Background of the Invention 

1. Object of the Invention 

The invention relates to the use of a nucleic acid construct 
io comprising at least one hormone responsive element and a transgene 
for preparing an agent for gene transfer. It further relates to 
particular nucleic acid constructs comprising at least one hormone 
responsive element and a transgene, wherein one of said at least one 
hormone responsive elements is not functionally linked to the 
15 transgene, vectors comprising such nucleic acid constructs and 
compositions of matter comprising such nucleic acid constructs 
wherein the hormone responsive elements of the constructs are 
coupled to a hormone-hormone receptor complex. The nucleic acid 
constructs, plasmids, and compositions of matter of the invention 
20 have applications in gene therapy, particularly in the treatment of 
human blood clotting disorders, such as hemophilia. They may also be 
used to up- or down-regulate target genes and for the delivery of 
vaccines. 

25 2. Summary of the Related Art 

Gene therapy is a method that holds great promise for many 
diseases and disorders. In general, it involves the transfer of 
recombinant genes or transgenes into somatic cells to replace proteins 
with a genetic defect or to interfere with the pathological process of 
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an illness. In principle, gene therapy is a simple method. In practice, 
many disadvantages must still be overcome. 

Research in gene therapy has concentrated on ways to most 
effectively incorporate DNA into cells of a patient. Viral vectors are 
5 currently the widely used vehicles in clinical gene therapy approaches. 
In terms of efficacy in gene expression, the viral delivery systems 
have major advantages over techniques using DNA-lipid formulations 
as delivery vehicles or over mechanical methods, such as the gene 
gun. Although there are a variety of viral systems tested for gene 

10 therapeutical strategies, retroviral vectors and adenoviral vectors are 
presently the most widely used vehicles (Salmons, B. and Gunzburg, 
W. H., Hum. Gene Ther., Vol. 4, 129, 1993; Kasahara, N. A., et a!., 
Science, Vol. 266, 1373, 1994; Ali, M., et al., Gene Ther., Vol. 1, 367, 
1994. ). Still, these systems have major disadvantages, such as, 

15 potential viral contamination. Other safety concerns continue- to 
hamper the development of clinical application of gene therapy using 
these viral systems. For example, recombinant retroviruses have the 
disadvantage of random chromosomal integration, which may lead to 
activation of oncogenes or inactivation of tumor-suppressor genes. 

20 Also, repetitive use of recombinant adenoviruses has caused severe 
immunological problems (Elkon, K. B. et al., Proc. Natl. Acad. Sci. 
USA, Vol. 94, 9814, 1997). The humoral response resulted, in the 
production of antibodies to adenovirus proteins preventing subsequent 
infection. Immunosuppressive drugs may ameliorate these effects, but 

25 they place an additional burden on the patient (Dai, Y., et al., Proc. 
Natl. Acad. Sci. USA, Vol. 92, 1401, 1995). 

Yet another viral delivery system involves adenoassociated virus 
(AAV). The AAV requires coinfection with an unrelated helper virus. 
Although such recombinant AAV virions have proven useful for 

30 introducing several small gene sequences into host cells, gene 
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delivery systems based on those particles are limited by the relative 
small size of AAV particles. This feature greatly reduces the range of 
appropriate gene protocols. Moreover, the need to also use a helper 
virus adds a complicating factor to this delivery system (Muzyczka, N., 

5 Curr. Top. Microbiol. Immunol., Vol. 158, 97, 1992). 

Though safer, non-viral gene therapy approaches are also 
unsatisfactory. Problems with inefficient gene delivery or poor 
sustained expression are major drawbacks. Yet the methods available 
such as the direct injection of DNA into cellular compartments, or the 

10 application of mixtures of DNA with cationic lipids or polylysine 
allowing the transgene to cross the cell membrane more easily, have 
not overcome these hurdles (Feigner, P., et al., Proc. Natl. Acad. Sci. 
USA, Vol. 84, 7413, 1987; Behr, J. -P., Bioconjugate Chemistry, Vol. 5, 
382, 1994). 

is Introduction of naked DNA (polynucleotide) sequences (including 

antisense DNA) into vertebrates, is reported to be achieved by 
injection into tissues such as muscle, brain or skin or by introduction 
into the blood circulation (Wolff, J. A., et al., Science, Vol. 247, 1990; 
Lin, H., et al., Circulation, Vol. 82, 2217, 1990; Schwartz, B., et al., 

20 Gene Then, Vol. 3, 405, 1996). Also, a direct gene transfer into 
mammals has been reported for formulations of DNA encapsulated in 
liposomes and DNA entrapped in proteoliposomes containing receptor 
proteins. Although injected naked DNA leads to transgene expression, 
the efficiency is by far not comparable to viral-based DNA delivery 

25 systems. A limitation of the method of naked DNA injection is the fact 
that transgene expression is dose-dependent. The gene expression is 
saturable, and an increase in the amount of DNA injected leads to 
decreased protein production per plasmid. Thus, protein expression 
can dramatically decrease, if the amount of DNA injected is above a 

30 certain threshold. 
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Among the genetic disorders that the skilled artisan has sought 
to overcome using these prior art methods are those relating to blood 
clotting disorders, and in particular, hemophilia (Lozier, J. N. and 
Brinkhous, K. M., JAMA, Vol.271, 1994; Hoeben, R. C, Biologicals, 

5 Vol. 23, 27, 1995). For example, hemophilia A and B are X-linked, 
recessive bleeding disorders caused by deficiencies of clotting factors 
VIII and IX, respectively (Sadler, 3. E. et al., in: The Molecular Basis 
of Blood Diseases, 575, 1987). The incidence of hemophilia is about 1 
in 5,000 male births. Hemophiliacs suffer from excessive bleeding due 

io to the lack of clotting at the site of wounds. The inability to clot 
properly causes damage to joints and internal tissues as well as 
posing risks to the proper treatment of cuts. 

Treatment of hemophilia A is possible by the administration of 
the blood clotting factor VIII. Until recently, factor VIII preparations 

is had to be prepared by concentrating blood from donors, posing the 
risk of contamination by infectious agents, such as HIV and hepatitis. 
The gene for factor VIII has been cloned (e.g., Vehar et al., Nature 
Vol. 312, 337 1984) allowing for the production of a recombinant 
product. Although recombinant methods provide factor VIII of higher 

20 purity than blood concentrates, the exogenous supply of factor VIII to 
a patient still requires repeated doses throughout the lifetime of the 
patient, an inconvenient and expensive solution. 

Other forms of hemophilia include hemophilia B, caused by a 
defect in the gene coding for Factor IX. The gene therapy systems 

25 described above have been attempted for the treatment of hemophilia 
A and B with factors VIII and IX, respectively. (See e.g., WO 
94/29471). However, these systems have the disadvantages already 
discussed above. 

On the other hand, the classical model of the action of hormones 

30 is based on the concept of binding interaction of the hormone to an 
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intracellular receptor, located in the cytoplasm or the nucleus (Evans, 
R., Science, Vol. 240, 889, 1988). These intracellular receptors 
remain latent until exposed to their target hormone. When so 
exposed, the hormone receptor changes its conformation after the 

5 hormone is bound and translocates in the activated form into the cell 
nucleus where it binds as a dimer to hormone responsive elements in 
the promoter region of hormone-regulated genes (Beato, M., Cell, Vol. 
56, 335, 1989; O'Malley, B., et al., Biol. Reprod., Vol. 46, 163, 
1992). The hormone responsive elements are enhancer elements 

10 usually located in the 5 ' flanking region of the specific hormone- 
induced gene, i.e., are functionally linked to the specific hormone 
induced gene. DNA constructs comprising a hormone responsive 
element and a nucleic acid sequence encoding a protein of interest are 
disclosed in U.S. Pat. Nos. 5,688,677 and 5,580,722 and are taught to 

is be suitable for expression of the protein of interest. 

An example of such intracellular receptors is the steroid 

receptor. Steroid receptors belong to a superfamily of ligand- 

m 

dependent transcription factors characterized by a unique molecular 
structure. The centrally located highly conserved DNA-binding domain 

20 defines this superfamily. The second important and relatively invariant 
region is the COOH-terminal ligand-binding domain. An example of 
such a receptor is the progesterone receptor mediated by the steroid 
progesterone. At the progesterone receptor, progesterone acts as a 
natural agonist whereas it displays potent antimineralocorticoid 

25 properties both at the molecular and the systemic level. Besides 
classical effects on the uterus, antiepileptic, anxiolytic, hypnotic and 
anesthetic properties have been attributed to progesterone according 
to numerous studies. 

Methods have been proposed for the use of mutant hormone 

30 receptors, including mutant steroid receptors for gene therapy. For 
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example, such methods are disclosed in WO 93/23431, WO 98/18925, 
WO 96/40911. Moreover, WO 98/33903 discloses a genetic construct 
comprising a steroid responsive element from a tissue specific gene, a 
coding sequence, and an SV40 enhancer. 

5 

Brief Description of the Invention 

The object of the present invention is to overcome the 
disadvantages of the previous gene therapy delivery systems. It was 
found that a hormone-hormone receptor complex possesses the 

10 ability to drag a nucleic acid construct having one or more hormone 
responsive element(s) through the cell membrane into a cell. It was 
also found that if the construct comprises further functional sequences 
besides the hormone responsive elements (hereinafter "transgenes"), 
the functional sequences exert their function. The hormone responsive 

15 element may also enhance the expression of the transgene. Moreover, 
it was found that steroid hormones are very effective mediators for 
the transfer of nucleic acid constructs through the cell membranes 
into a cell. The present invention thus provides 

(1) the use of a nucleic acid construct comprising at least one 
20 hormone responsive element (hereinafter referred to as "HRE") and a 

transgene for preparing an agent for gene transfer (said at least one 
HRE being functionally linked to the transgene or not); 

(2) a preferred embodiment of (1) above, wherein the agent 
further comprises a hormone-hormone receptor complex; 

25 (3) a nucleic acid construct comprising at least one HRE and a 

transgene, wherein one of said at least one HREs is not functionally 
linked to the transgene; 

(4) a vector comprising the nucleic acid construct of (3) above; 
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(5) a transformed cell or transgenic organism comprising the 
nucleic acid construct as defined in (3) above or the vector as defined 
in (4) above; 

(6) a composition of matter comprising a nucleic acid construct 
5 comprising at least one HRE and a transgene as defined in (3) above 

and/or a vector as defined in (4) above, said at least one HRE being 
coupled to a hormone-hormone receptor complex; 

(7) a preferred embodiment of (6) above, wherein the 
transgene is a gene encoding a blood clotting factor; 

10 (8) a preferred embodiment of (7) above, wherein the blood 

clotting factor is factor IX; 

(9) a preferred embodiment of (7) above, wherein the blood 
clotting factor is factor VIII; 

(10) a pharmaceutical composition comprising the nucleic acid 
is construct as defined in (3) above and/or the composition of matter as 

defined in (6) to (9) above; 

(11) a method for preparing the composition of matter as 
defined in (6) above, which method comprises admixing the nucleic 
acid construct with the hormone receptor and the hormone; 

20 (12) a method for gene transfer which comprises administering 

the agent as defined in (1) and (2) or the composition of matter as 
defined in (6) to (9) above to an organism or to a cellular system; 

(13) a method for delivering into an organism or into a cellular 
system a nucleic acid encoding a transgene to be expressed in the 

25 cells of the organism or the cells of the cellular system, which method 
comprises administering an agent as defined in (1) above or 
composition of matter as defined in (6) to (9) above to the organism 
or to the cellular system so that the hormone in the composition 
interacts with the cell membrane and therewith enhances diffusion 
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and transport of the nucleic acid that is coupled to the hormone- 
hormone receptor complex across the membrane and into the cell; 

(14) a method of treating blood clotting disorders comprising 
administering a therapeutically effective amount of the composition of 

5 matter as defined in (7) above to an organism or to a cellular system; 

(15) a method of treating hemophilia B, comprising 
administering a therapeutically effective amount of the composition of 
matter as defined in (8) above to an organism or to a cellular system; 

(16) method of treating hemophilia A, comprising administering 
10 a therapeutically effective amount of the composition of matter as 

defined in (9) above to an organism or to a cellular system; 

(17) use of a steroid hormone for preparing an agent for gene 
transfer; and 

(18) a method for gene transfer which comprises administering 
15 a nucleic acid construct to an organism or to a cellular system, 

wherein the nucleic acid construct contains a transgene and is 
encapsulated in a steroid hormone. 

In a preferred embodiment of (1) to (16) above the hormone 
20 responsive element is a steroid responsive element (SRE), most 
preferably a progesterone responsive element (PRE). In embodiments 
(2) and (6) to (16) the receptor preferably is a steroid receptor, most 
preferably, a progesterone receptor. Similarly, the hormone is 
preferably a steroid, most preferably, progesterone. 
25 The present invention thus provides a delivery system for gene 

therapy that should overcome the prior art disadvantages. The 
presence of the hormone responsive element on the nucleic acid 
carrying a transgene encourages the binding of a hormone-hormone 
receptor complex. Thus, the present invention uses the activated 
30 hormone receptor as a link (or binding compound) between the 
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nucleic acid carrying the transgene and the hormone known to 
interact with the cell membrane. The generalknown biological activity 
mediated by the HREs is not the primary effect utilized in the present 
invention, but might be an additional effect when regulation of the 

5 transgene is desired. The general principle is depicted in Figure 1. The 
hormone responsive element is preferably present as a nucleic acid 
dimer sequence or nucleic acid multimer sequence. Even in an inverse 
orientation, the hormone responsive element will exert its proper 
function. The hormone-hormone receptor complex contains a 

10 hormone receptor that becomes activated after binding of its specific 
hormone. The hormone receptor in the activated state is able to 
recognize and bind to its specific hormone responsive element, which 
in the present invention is present within the nucleic acid comprising 
the desired transgene, e.g., a human blood-clotting factor. 

15 Vaccination is another aspect of the embodiment (12) defined 

above. Introducing a nucleic acid construct or composition of matter 
of the invention comprising a gene for an antigen or containing a viral 
sequence into a cell (DNA vaccines) using the method mentioned 
above may also provide a way to stimulate the cellular immune 

20 response. 

Brief Description of the Drawings 

Figure 1 shows the concept of gene transfer of the present invention 
25 (with HRE = hormone responsive element, HR = hormone receptor, H 

= hormone, blank circles = lipophilic matrix). 

Figure 2 is a diagram of the vector pTGFGl. 

Figure 3 is a diagram of the vector pTGFG5. 

Figure 4 is a diagram of the vector pTGFG20. 
30 Figure 5 is a diagram of the vector pTGFG33. 
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Figure 6 is a diagram of the vector pTGFG36. 
Figure 7 is a diagram of the vector pTGFG53. 
Figure 8 is a diagram of the vector pTGFG64. 

Figure 9 is the DNA sequence of vector pTGFG36 (SEQ ID NO: 1). 
5 Figure 10 shows the protein sequence of factor IX encoded by vector 
pTGFG36 (SEQ ID NO: 2). 

Figure 11 shows a GFP concentration curve for cell homogenates after 
transfection with pTGFG5 and pTGFG20, respectively. 
Figure 12 shows corresponding light (a and c) and fluorescent (b and 
10 d) micrographs of HeLa cells transfected with pTGFGS (a and b) and 
pTGFG20 (c and d), respectively. 

Figure 13 shows the amount of GFP expressed by utilizing the 
favoured vectors of the invention in a transfection experiment. 
Relative fluorescence units from mock and background can be clearly 
15 separated. 

Figure 14 shows the additive effect of human clotting factor IX on 
clotting activity of mouse blood. 

Figure 15: hPR (A-form) was expressed in insect cells and purified by 
cobalt 2 "*" affinity chromatography as described in Example 5. The final 

20 preparation (85pg protein) was separated on a denaturing 7,5% SDS- 
polyacrylamid gel, followed by staining with coomassie® R250 (lane A) 
or western blotting with hPR-specific staining (lane C). 
Lane B: Molecular mass standard. Arrows indicate the two highly 
enriched protein species (94 and 74 kDa) accessible to 

25 immunodetection. 

Figure 16: Domain structure of hPR-B (numbers on the top of the bar 
represent amino acid positions within the polypeptide sequence). 
Figure 17 shows the mean values of the difference in the clotting time 
of Example 9. 

30 Figure 18 shows the clotting time detected in Example 9. 
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Figure 19 shows the activity of human progesterone receptor as 
determined in Example 8. 

Figure 20: shows the amino acid sequence of the hPR B-Form. The 
start methionine 165 of the hPR A-Form is underlined (SEQ ID NO: 
5 18). 

Figure 21 shows the nucleic acid sequence of the mRNA coding for 
hPR. The reading frame for the hPR B-form starts at position 176, the 
reading frame for the hPR A-Form at position 668. The respective 
start codons ATG are underlined (SEQ ID NO: 19). The sequences of 
10 Figures 20 and 21 are taken from Genbank, accession number 
AF016381. 

Detailed Description of the Invention 

is 1. Definitions 

"Nucleic acid" means DNA, cDNA, mRNA, tRNA, rRNA. The 
nucleic acid may be linear or circular, double-stranded or single- 
stranded. 

"Nucleic acid construct" refers to a composite of nucleic acid 
20 elements in relation to one another. The nucleic acid elements of the 
construct may be incorporated into a vector in such an orientation 
that a desired gene may be transcribed, and if desired, a desired 
protein may be expressed. 

"Transgene" refers to a functional nucleic acid sequence which is 
25 transcriptionally active (with or without regulatory sequences). 
"Gene transfer" includes "gene therapy". 

"Hormone responsive element" (HRE) refers to regions of 
nucleic acids, and in particular, DNA, which regulate transcription of 
genes in response to hormone activation. HREs are typically about 10- 
30 40 nucleotides in length, and more usually, about 13-20 nucleotides in 
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length. As explained above, HREs become activated when a hormone 
:. binds to its corresponding intracellular receptor causing a 
conformational change, so that the receptor has increased affinity for 
the HRE and binds to it. The HRE, in turn, stimulates transcription. A 
5 "steroid responsive element" (SRE) is an HRE that regulates 
transcription of genes in response to steroid activation. A 
"progesterone responsive element" (PRE) is an HRE/SRE that 
regulates transcription of genes in response to progesterone 
activation. 

10 A "hormone receptor" refers to a receptor which binds to and is 

activated by a hormone. A "steroid receptor" refers to a receptor 
which binds to and is activated by a steroid hormone. A "progesterone 
receptor" is a receptor which binds to or is activated by the steroid 
hormone progesterone. 

is "Functionally linked" refers to configurations of the nucleic acid 

construct, where the HRE (or SRE/or PRE) is located within the 
construct so that it can stimulate transcription of the transgene. "Not 
functionally linked" refers to configurations where the HRE is so 
remotely located from the transgene that it cannot stimulate its 

20 transcription. 

"Gene" refers to DNA sequence encoding a polypeptide, 
optionally including leader and trailer sequences and introns and 
exons. 

"Vector" refers to any genetic construct, such as a plasmid, 
25 phage, cosmid, etc., which is capable of replication when associated 
with the proper control elements and which can transfer gene 
sequences between cells. The term includes cloning and expression 
vehicles. 

"Promoter" refers to a region of regulatory DNA sequences for 
30 the control of transcription of a gene to which RNA polymerase binds. 
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The promoter forms an initiation complex with RNA polymerase to 
initiate and drive transcription activity. "Enhancers" may activate the 
complex or "silencers" may inhibit the complex. A "tissue-specific 
promoter" is a promoter found in the DNA of tissue for transcription of 
5 genes expressed in this specific tissue. 

"Organism" refers to a multicellular living entity including 
vertebrates such as mammals (especially humans, cattle, rodents, 
dogs) and invertebrates. 

"Cellular system" includes cell cultures, e.g., primary cell 
10 cultures (especially those suitable for reimplantation), stem cells, 
blood cells, tissue samples and whole organs and immortalized cell 
cultures. 

"Therapeutically effective dose" of the products of the invention 
refers to a dose effective for treatment or prophylaxis, for example, a 

15 dose that yields effective treatment or reduction of the symptoms of 
hemophilia. It is also a dose that measurably activates expression of a 
target gene as determined by measurements of target protein levels, 
or a dose that is predictable to be effective for treatment or 
prophylaxis by extrapolating from in vitro or in vivo data. The 

20 determination of a therapeutically effective dose is within the purview 
of one skilled in the art. 

"Encodes" or "encoding" refers to a property of the nucleic acid 
sequence of being transcribed (in case of DNA) or translated (in the 
case of mRNA) into a polypeptide in vitro or in vivo when placed under 

25 the control of appropriate regulatory sequences. 

For the purposes of this application, "express", "expressing" or 
"expression" shall refer to transcription and translation of a gene 
encoding a protein. 

30 
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2. Detailed Description and Examples 

As stated above, an object of the present invention is to provide 
a new and improved delivery system for gene therapy. The invention 
thus provides nucleic acid constructs comprising at least one HRE and 
a transgene wherein one of said at least one HREs is not functionally 
linked to the transgene, and compositions of matter comprising such 
nucleic acid construct wherein said at least one HRE is coupled to a 
hormone-hormone receptor complex (embodiments (3) and (6) 
defined above). A preferred embodiment of the nucleic acid construct 
and of the composition of matter of the invention is one where the 
hormone responsive element is a steroid responsive element (SRE), 
and the receptor is a steroid receptor. Most preferably, the hormone 
responsive element is a progesterone responsive element (PRE), and 
the receptor is a progesterone receptor. 

Potential HREs for use in the present invention have been 
previously described. For example, GREs (Scheidereit, C, et al., 
Nature, Vol. 304, 749, 1983; von der Ahe, D., et al., Proc. Natl. Acad. 
Sci. USA, Vol. 83, 2817, 1986), EREs or PREs (Chambon, P., et al., 
Rec. Prog. Horm. Res., Vol., 40, 1, 1984; Klock, G., et al., Nature, 
Vol. 329, 734, 1987). As already stated above, the most preferred 
HRE for the invention is a PRE. Specifically, the preferred PRE is 
described in Example 1, i.e., is the double stranded DNA sequence 
comprised of SEQ ID NOs: 3 and 4. The nucleic acid for use in the 
invention comprises at least one hormone responsive element. 
Preferred is a nucleic acid comprising more than one HRE. For 
example, the nucleic acid may comprise three to ten, preferably three 
to five HREs. The most preferred embodiment is a nucleic acid 
comprising three to five PREs. 

Potential hormone receptors for use in the present invention 
are, for example, estrogen receptors, mineralocorticoid receptors, 
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glucocorticoid receptors, retinoic acid receptors, androgen, calcitriol, 
thyroid hormone or progesterone receptors and orphan receptors. 
Such receptors have been previously described. (Green, S., et al., 
Nature, Vol. 320, 134, 1986; Green, G. L.,et al., Science, Vol. 231, 

5 1150, 1986; Arriza, J. L., et al., Science, Vol. 237, 268, 1987; 
Hollenberg, S. M., et al., Nature, Vol. 318, 635, 1985; Petkovitch, M., 
et al., Nature, Vol. 330, 444, 1987; Giguere, V., et al., Nature , Vol. 
330, 624, 1987; Tilley, W., et al., Proc. Natl. Acad. Sci. USA, Vol. 86, 
327, 1989; Baker, A. R., et al., Proc. Natl. Acad. Sci. USA, Vol. 85, 

io 3294, 1988; Weinberger, C., et al., Nature, Vol. 324, 641, 1986; Sap, 
J., et al., Nature, Vol. 324, 635, 1086; Misrahi, M., et al., Biochem. 
Biophys. Res. Commun., Vol. 143, 740, 1987; Kastner, P., et al., Cell, 
Vol. 83, 859, 1995). These receptors may be from human or other 
mammalian sources, although human is preferred. Nucleotide and/or 

15 amino acid sequences of human steroid receptors are available in the 
GenBank: mineralocorticoid receptor: M16801; glucocorticoid receptor 
a: M10901; glucocorticoid receptor a 2 '- U01351; glucocorticoid 
receptor 6: M11050; retinoic acid receptor a: AF088888 (exon 1), 
AF088889 (exon 2), AF088890 (exon 3), AF088891 (exon 4), 

20 AF088892 (exon 5 and 6), AF088893 (exon 7), AF088894 (exon 8), 
AF088895 (exon 9 and complete cDNA); retinoic acid receptor y: 
M24857; androgen receptor: M27423 (exon 1), M27424 (exon 2), 
M27425 (exon 3), M27436 (exon 4), M27427 (exon 5), M27428 (exon 
6), M27429 (exon 7), M27430 (exon 8); thyroid hormone receptor ai: 

25 M24748, thyroid hormone receptor <x 2 : J03239; progesterone 
receptor: AF016381; somatotropin receptor: J00148; vitamin D 
receptor (calcitriol receptor): 303258. 

The skilled person will understand that expression of the 
receptor proteins can be achieved by standard methods, e.g. via PCR- 

30 cloning of the known cDNAs from cDNA libraries and overexpression of 
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the corresponding proteins in suitable expression vectors, such as, for 
example, the vectors of the present invention, in suitable host cells, 
e.g;, COS cells. Accordingly, subsequent purification of the cytosolic 
fraction can be achieved by routine methods such as affinity 
chromatography purification. For this purpose, various suitable 
antibodies against the desired receptor are commercially available. 
For example, polyclonal antibodies against the mouse progesterone 
receptor that have a sufficiently high cross- reactivity for the human 
protein are available from Dianova (Hamburg, Germany). Likewise, 
further purification can be achieved by standard methods, e.g., 
chromatographical methods such as ion-exchange chromatography 
and/or FPLC. 

The most preferred receptor is the progesterone receptor. 
Preferably, the receptor is a human progesterone receptor. Such a 
human progesterone receptor (from T47D human breast cancer cells) 
is disclosed in US Patent No. 4,742,000, and cells expressing this 
receptor have been deposited (ATCC deposit number HTB, 133). As 
already described above, it would be routine to purify such a receptor 
from the cytosol using receptor specific antibodies. In addition, US 
Patent No. 4,742,000 discloses a method for purification of the human 
progesterone receptor using a specific steroid affinity resin (cf. 
Grandics et al., Endocrinology, Vol. 110, 1088, 1982). 

Briefly, the cytosolic fraction of the T47D cells is passed over 
Sterogel, a commercial preparation of deoxycorticosterone coupled to 
Sepharose® 2B that selectively binds the progesterone receptor. After 
washing with loading buffer, the bound receptor is eluted with a buffer 
containing progesterone. The eluted steroid-receptor complex is then 
chromatographed on DEAE-Biogel and eluted stepwise with a buffer 
containing 0.2M NaCI. Subsequently, the bound progesterone can be 



WO 00/49147 



PCT/EP00/01368 



17 

readily exchanged. As described above, further purification can be 
achieved by routine methods well-known to the skilled person. 

An alternative method is disclosed in Example 5. 
The structure of the hPR polypeptide is depicted in Fig. 16. The hPR 
polypeptide is composed of distinct structural domains. Naturally the 
human progesterone receptor (hPR) is expressed as two different 
sized proteins termed hPR-B (120 kDa) and hPR-A (94 kDa). HPR-A is 
a truncated but otherwise identical form of hPR-B, that is missing 165 
the N-terminal amino acids (see Fig. 20, SEQ ID NO: 18). Both forms 
seems to be indistinguishable regarding their progesterone or DNA 
binding properties. In human cells the A and B forms of hPR are 
produced from the same gene by alternate initiation of translation at 
two different AUG start sites within the same RNA transcript. As it was 
reported earlier hPR-A and B can be expressed in Spodoptera 
frugiperda (Sf9) cells as biological fully active polypeptides 
(Christensen et a/., Mol. Endocrinol. 5, 1755ff (1991); Elliston et al., 
JBC 267, 5193-5198 (1992)). 

The carboxyl terminus of the hPR polypeptide as shown in Fig. 
16 comprises a progesterone binding domain (PBD) but also contains 
sequences responsible for the association with heat shock proteins 
and receptor dimerization. The hinge region provides a flexible link 
between the DNA-binding domain (DBD) and the PBD but is also 
thought to contain elements for receptor dimerization as well as 
nuclear localization. Binding of the hPR to its corresponding target 
sites at the chromosomal DNA (PREs, Progesterone Responsive 
Elements) is known to be mediated by the DBD. The remaining N- 
terminal trans-activation domain (TAD) consists of regions specific for 
the in vivo function of the hPR as a transcriptional gene activator. 

Even though the N-terminus also seems to contribute directly to 
the homodimerization of hPR after progesterone binding, it has been 
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demonstrated that a fragment comprising only the hinge region and 
the PBD was the minima! C-terminal fragment to mediate 
progesterone dependent hPR-hPR-interaction (Tetel et a/., Mol. 
Endocrinol. 11, m 4 ff. (1997). It is believed that genetically 
engineered hPR polypeptides lacking either in part or completely the 
TAD (amino acids 1 to 556) might be expressed as structurally stable 
and fully soluble dimers in the presence of progesterone. Complexes 
with such a truncated hPR (provided that said truncated hPR exhibits 
DNA-binding activity as well as progesterone-binding activity) may 
functionally replace the complexes with the full length form of the 
described recombinant hPR-A or hPR-B proteins, since still mediating 
the contact between the plasmid DNA and the progesterone. Thus, the 
hPR in embodiments (2) and (6) to (16) of the invention preferably is 
a PR comprising nucleic acids 557 to 933 of natural h PR shown in SEQ 
ID NO: 18. 

Effective expression of such a truncated version of hPR is 
possible in the baculovirus system but also in other eukaryotic 
expression systems, such as cultivated mammalian cells or yeast cells. 
Furthermore, also an E. coli overexpression strain is a possible 
system for the production of those polypeptides. In this case, the 
fusion of such a truncated hPR-version to a suitable polypeptide 
sequence, e.g. a histidine containing sequence or the GST (glutathion 
S- transferase) protein, might be helpful to overcome insolubility 
problems as well as to facilitate the isolation and purification of the 
expressed protein. 

Mutated versions of these receptors and derivatives thereof, 
that still retain the function of the receptors to bind a ligand and 
thereby become activated and bind DNA and regulate transcription, 
may also be employed in the invention. Such derivative may be a 
chemical derivative, variant, chimera, hybrid, analog, or fusion. 
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The third component of the gene transfer system of the 
invention is the hormone. The hormone in the agent of embodiment 2 
and in the composition of matter of embodiment (6) include synthetic 
and natural hormones, preferably steroid hormones, such as estrogen, 
5 testosterone, glucocorticoid, androgen, thyroid hormone, and 
progesterone or derivatives thereof. These are widely available. 
Progesterone is most preferred. For example, natural micronized 
progesterone is the preferred progesterone from which has been 
marketed in France since 1980 under the trademark of UTROGESTAN® 

io and is still available in Germany under the trademark UTROGEST®. Its 
properties are similar to the endogenous progesterone, in particular, it 
has antiestrogen, gestagen, slightly antiandrogen and 
antimineralocorticoid properties. The natural micronized progesterone 
in said marketed products is dispersed in a matrix as described 

15 hereinbelow. 

The above micronized progesterone has advantages that make it 
a suitable carrier for genes or nucleic acid constructs to target cells. 
Specifically, the synergistic effect of the double process of 
micronization and suspension in long-chain fatty acids residues of an 

20 oil results in increasing progesterone absorption. It has been 
demonstrated that after oral administration of 100 mg of 
UTROGESTAN®, peak plasma progesterone levels were obtained after 
1-4 hours in most cases (Padwick, M. L., et al., Fertil. Stent., Vol. 46, 
402, 1986). Later on, the levels declined substantially, although they 

25 were still elevated at 12 hours. Even at 84 hours the levels were 
slightly higher than baseline. A U.S. kinetic study confirmed earlier 
work demonstrating the bioavailability of oral micronized 
progesterone. They showed a peak effect at 2 hours followed by rapid 
decrease in plasma progesterone level (Simon, J. A., et al., Fertil., 

30 Steril., Vol., 60, 26, 1993). 
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A further advantage of using progesterone as a carrier is the low 
level of disadvantageous side effects. Orally administered 
progesterone adversely affects neither plasma lipids (Jensen, J. et aj., 
Am. J. Obstet. Gynecol., Vol. 156, 66, 1987) nor carbohydrate 

5 metabolism (Mosnier-Pudar, H. et al., Arch. Mai. Coeur, Vol 84, 1111, 
1991). Further, progesterone does not affect liver enzymes (ASAT, 
ALAT, AFOS), sex-hormone binding-globulin (SHBG) synthesis or HDL- 
cholesterol levels at daily doses of 200 mg and 300 mg. Although the 
plasma levels of deoxycorticosterone may increase substantially 

10 during UTROGESTAN® treatment, there are strong indications that the 
mineralocorticoid effects of this progesterone metabolite are 
completely counteracted by the anti-mineralocorticoid effects of 
progesterone itself. This is apparent from a comparative study 
(Corvol, P., et al., In: Progesterone and progestins. Raven Press, New 

15 York> 179, 1983) in which oral UTROGESTAN® was capable of 
antagonizing the mineralocorticoid effects of 9-a-fluorohydrocortisone. 

In the agent of embodiment (2) and in the composition of 
matter of embodiment (6) of the invention the molar ratio of HRE (or 
SRE/or PRE) within the nucleic acid construct to hormone receptor is 

20 preferably from 1:1 to 1:10, more preferably from 1:2 to 1:5. On the 
other hand, the molar ratio of hormone to hormone receptor is 
preferably at least 1000:1, more preferably at least 10000:1. Thus, 
the hormone is present in a large excess relative to the hormone 
receptor and the HRE, which is desirable in view of the ability of the 

25 hormones to transfer nucleic acid constructs through cell membranes. 

The skilled artisan will appreciate that the agent of embodiments 
(1) and (2) and the pharmaceutical composition of embodiment (10) 
may contain other components capable of assisting in introducing the 
nucleic acid into a cell for the purpose of gene therapy (matrix 

30 compounds). Specifically, the agent and the composition, especially 

_0049147A1_L> 



WO 00/49147 



PCT/EP00/01368 



the hormone component thereof, may contain the following matrix 
compounds: glucose and related compounds (such as D-sorbitol, D- 
mannitol); solubilizing adjuvants (such as alcohols, e.g., ethanol); 
polyhydric compounds such as glycerine, polyethylene glycol and 

5 polypropylene glycol; nonionic surface active compounds, ionic surface 
active compounds such as lecithin; oily compounds such as sesame 
oil, peanut oil soybean oil, corn oil, etc.; starches and their derivatives 
such as cyclodextrines and hydroxyalkylated starches; stabilizers such 
as human serum albumin, preservatives such as benzyl alcohol and 

io phenol; and the like. The preferred matrix contains 6-cyclodextrine, 
glycerine, lecithin and/or corn oil. For example, the pharmaceutical 
composition of hormone-hormone receptor nucleic acid complex of the 
invention may be provided orally to humans or animals as a gelatin 
capsule. Progesterone therein (preferably in micronized form) could 

15 be present in a concentration of 50 to 1000 mg, preferably 200 -300 
mg dissolved in a 35 % or 40 % (3-cyclodextrin solution or in cornoil 
or gycerol with peanut oil together with lecithin. 

Alternatively, when - due to the selection of appropriate matrix 
components - the pharmaceutical composition is in a pasty, gel-like 

20 form, it may be provided topically. 

The nucleic acid construct of embodiments (1) to (16) of the 
present invention may - aside from the transgene and the HREs, 
SREs, or PREs already disclosed above - further contain promoter, 
enhancer, and/or silencer sequences. The promoter may be ubiquitous 

25 or tissue-specific. Of the ubiquitous promoters, the CMV promoter is 
most preferred. However, a tissue-specific promoter is preferred over 
a ubiquitous promoter. For example, the tissue-specific promoters 
envisioned for the instant invention include ai-antitrypsin (further 
promoters). 
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The nucleic acid construct may further comprise additional 
sequences such as the ampicillin resistance gene. Other reporter 
sequences known to the skilled artisan may also be included, such as, 
for example, the green fluorescent protein (GFP), luciferase, 6- 
5 galactosidase or chloramphenicolacetyltransferase (CAT). As an 
enhancer sequence, the SV40 intron and SV40 Poly A are most 
preferred. The nucleic acid construct may further contain inducible 
promoters such as, for example, a MMTV (Mouse Mammary Tumor 
Virus) promoters inducible via glucocorticoides and Ecdyson-inducible 

io insect promoters. 

A preferred nucleic acid construct contains sequentially from the 
5' to the 3' end: a PRE, a CMV promoter, a gene of interest, SV40 
Intron and SV40 poly A enhancer sequence, and an ampicillin 
resistant gene. Further PREs are evenly distributed on the vector 

is backbone. 

The nucleic acid construct may further contain origin of 
replication sequences (especially eukariotic origin of replication 
sequences), elements for gene targeting, integrational sequences 
(e.g., AAV-ITR, transposon IS), 3'-UTR, "switch" systems (e.g., TET 

20 system, Cre/loxP or Flp/ftr system). 

The transgene may be chosen from those encoding proteins 
lacking in a variety of genetic disorders or involved in conditions 
related to inappropriate responses to hormones, for example, 
hormone-dependent cancers such as breast, ovarian, and endometrial 

25 cancers and prostate cancer. The transgene may also be used to 
replace a defective gene resulting in such genetic disorders as 
hemophilia, von Willebrand disease, and cystic fibrosis. The transgene 
includes mutations of such gene or a gene encoding a fusion product. 
The nucleic acid construct of the present invention may comprise 

so more than one transgene. 
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In particular, the transgene may replace genes for a blood 
clotting factor, and preferably a human blood-clotting factor. The 
genes encoding factor VIII and factor IX (sown in Fig. 2, SEQ ID NO: 
2), involved in hemophilia A and B, respectively, are good candidates 

5 for the invention. Other candidates include the gene encoding von 
Willebrand factor, factor IV, factor X, or protein C. 

Other useful transgenes include, but are not limited to, hormone 
genes such as the genes encoding for insulin, parathyroid hormone, 
luteinizing hormone releasing factor (LHRH), a and 6 seminal inhibins 

10 and human growth hormone; hormone receptor genes such as the 
glucocorticoid receptor, the estrogen receptor, the progesterone 
receptor, the retinoic acid receptor; growth factors such as vascular 
endothelial growth factor (VEGF), nerve growth factor, epidermal 
growth factor; enzyme genes; genes encoding cytokines or 

is lymphokines such as interferons, granulocytic macrophage colony 
stimulating factor (GM-CSF), colony stimulating factor-1 (CSF-1), 
tumor necrosis factor (TNF), and erythropoietin (EPO); genes 
encoding inhibitor substances such as ^antitrypsin, and genes 
encoding substances that function as drugs, e. g., genes encoding the 

20 diphteria and cholera toxins, ricin or cobra venom factor. Also, 
antisense sequences may be administered as genetic material. 

Another aspect of the present invention is vectors comprising 
the nucleic acid constructs of embodiment (3) of the present 
invention. These vectors may be used in the composition matter of 

25 embodiment (6) of the present invention. Preferably, however, the 
nucleic acid sequence for use in the invention is circular rather than 
linear. The vectors may be capable of expressing the nucleic acid in 
the nucleic acid construct transiently or permanently (including 
episomally). As noted above, the nucleic acid construct therein may 

30 further contain additional elements. 
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The composition of matter of embodiment (6) of the invention 
can be prepared by admixing the nucleic acid construct with the 
hormone receptor and the hormone. Preferably, an aqueous solution 
of nucleic acid construct was added to the oily suspension containing 
5 the hormone at ambient temperature under stirring. 

Embodiment(s) of the invention relates to transfected and 
transformed cells or transgenic organism comprising these vectors 
and/or nucleic acid constructs. Within the scope of this invention, a 
transfected cell is one in which foreign DNA has been incorporated. 
io Methods of transfection may include microinjection, CaP0 4 
precipitation, electroporation, liposome fusion, or gene gun. 

Transformation refers to introducing genetic material into a cell, 
such as the vectors or nucleic acid constructs of the invention, 
rendering the cell transiently or permanently altered so that the cell 
15 expresses a specific gene product or is otherwise altered in its 
expression. Transformation may be achieved by in vivo or in vitro 
techniques, although in vivo transformation is preferred. 

A further embodiment of the present invention is pharmaceutical 
compositions comprising a therapeutically effective dose of the nucleic 
20 acid constructs of the invention and a hormone. The hormone is 
preferably a steroid, and most preferably, progesterone, as described 
above. The dose is dependent on the condition to be treated, the 
characteristics of the patient, and the result sought to be achieved. 
Determining dosage is within the realm of the skilled artisan. 
25 The pharmaceutical composition (or, alternatively, the 

composition of matter, the nucleic acid construct, or the vector) of the 
present invention may be administered orally, intravenously, 
intramuscularly, subcutaneously, topically, through mucosa (including 
buccal, nasal spray) or by gene gun. Oral administration (of a 
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micronized hormone dispersion) is preferred. Delivery may be 
systemic or directed at certain tissue. 

The invention further includes a method of introducing into a cell 
a nucleic acid construct encoding a gene of interest, e.g., a human 
blood-clotting factor, to express the blood-clotting factor in the cell. In 
this method, the nucleic acid encoding a human blood-clotting factor 
is combined with a nucleic acid construct comprising at least one 
hormone responsive element (HRE), preferably a progesterone 
responsive element. 

The mixture of nucleic acid bound to the hormone-hormone 
receptor complex together with an excess of hormone, preferably 
progesterone, will be used to introduce the nucleic acid into a cell by 
various methods known to the skilled artisan and outlined above. The 
cell-uptake will be stimulated by the interaction of the hormone with 
the cell membrane. The hormone or steroid interacts with the lipid 
bilayer of the cell membrane not only through membrane perturbation 
but also through activation of certain hormone- or steroid-sensitive 
membrane receptors. This has been demonstrated for progesterone 
and other steroids. Last but not least, it is known that hormones are 
able to cross the cell membrane by diffusion. In the present invention, 
the nucleic acid bound to the hormone-hormone receptor complex 
should be transported through the membrane during the process of 
diffusion or uptake. 

Another aspect of the invention is a method of treating a blood 
clotting disorder by administering a therapeutically effective amount 
of the composition of matter of the invention to an organism. This 
method involves the administration and dosage considerations already 
discussed. 

Embodiments (17) and (18) of the invention pertain to the use 
of a steroid hormone for preparing an agent for gene therapy and/or 
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gene transfer and to method for gene therapy and/or gene transfer 
which comprises administering a nucleic acid construct to an organism 
or to a cellular system, wherein the nucleic acid construct contains a 
transgene and is encapsulated in a steroid hormone. Suitable steroid 

5 hormones are enumerated hereinafter. The preferred steroid hormone 
in said embodiments of the invention is a natural micronized steroid 
hormone, in particular a natural micronized progesterone. In a 
preferred embodiment, the micronized hormone is 
solubilized/dispersed in a lipophilic matrix as described hereinafter. 

10 Experiments have been performed to illustrate the technical 

aspects of the present invention. These experiments are described in 
examples 1 to 9 below. The skilled artisan will be readily recognize 
that the invention is not limited to these examples. 



15 Examples 

Example 1: Construction of Vectors 

Production of the vector pTGFGl: The vector pUC19 (MBI Fermentas) 
20 was digested with Xbal, treated with Klenow enzyme and religated. 
This Xbal deleted vector was then digested with EcoRI, treated with 
Klenow enzyme and religated in order to delete the EcoRI site. For 
insertion of a Xbal site in the SacI site of this vector it was digested 
with SacI, treated with T4-polymerase, dephosphorylated with alkaline 
25 phosphatase and ligated with the Xbal-linker CTCTAGAG (Biolabs 
#1032). Another Xbal-site was inserted by digesting the newly 
produced vector with Hindlll, treating it with Klenow, 
dephosphorylating it with alkaline phosphatase and ligating it with the 
Xbal-linker CTCTAGAG (Biolabs #1032). This vector was named 
30 pUC19/X. 
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In order to destroy the Xbal-site present in the vector phGFP- 
S65T (Clontech) this vector was digested with Xbal, treated with 
Klenow enzyme and religated resulting in the vector pGFP/0. A 2.3 kb 
fragment containing the GFP-Gene was isolated after digesting pGFP/0 
with Mlul, treating it with Klenow enzyme and digesting it with BamHI. 
This fragment was inserted into the multiple cloning site of the vector 
PUC19/X which was digested with Sail, treated with Klenow enzyme 
and digested with BamHI. The resulting vector was named pTGFGl 
(Figure 2). 

Starting with this vector all the vectors described in Table 1 
were obtained. At the restriction sites for PstI, Kpnl, Ehel, EcoO109 
and/or Sapl a PRE(ds) was inserted giving rise to plasmids carrying 
the GFP gene and up to five PREs. By exchanging the GFP gene with a 
FIX gene a set of FIX expression plasmids were obtained. By excising 
the GFP gene the cloning vectors without a transgene were obtained. 

Production of the insert PRFf ds )- The oligonucleotides (Metabion) 
PRE-S (5'-GGG GTA CCA GCT TCG TAG CTA GAA CAT CAT GTT CTG 
GGA TAT CAG CTT CGT AGC TAG AAC ATC ATG TTC TGG TAC CCC-3'; 
SEQ ID NO: 3) and 

PRE-AS (5*-GGG GTA CCA GAA CAT GAT GTT CTA GCT ACG AAG CTG 
ATA TCC CAG AAC ATG ATG TTC TAG CTA CGA AGC TGG TAC CCC-3'; 
SEQ ID NO: 4) 

were hybridized and phosphorylated by kinase reaction, resulting in 
the insert PRE(ds). 

Production of the vector nTCFf^- The vector pTGFGl was digested 
with EcoO109I, treated with Klenow enzyme and dephosphorylated 
with alkaline phosphatase. It was then ligated with the PRE(ds) insert, 
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resulting in the vector pTGFGS (Figure 3), i.e., a vector which carries 
a PRE at position C of Fig. 2. 

Production of the vector DTGFG20: The vector pTGFGl was digested 
5 with Kpnl, treated with T4-polymerase and dephosphorylated with 
alkaline phosphatase. It was then ligated with the PRE(ds) insert, 
resulting in the vector pTGFG7. This vector pTGFG7 was digested with 
PstI, treated with T4-polymerase and dephosphorylated with alkaline 
phosphatase. It was then ligated with the PRE(ds) insert, resulting in 
io the vector pTGFGll. Subsequently, pTGFGll was digested with 
EcoO109I, treated with Klenow enzyme and dephosphorylated with 
alkaline phosphatase. It was then ligated with the PRE(ds) insert, 
resulting in the vector pTGFG20 (Figure 4). This vector carries a PRE 
at positions A, B and D of Fig. 2. 

15 

Production of the vector PTGFG33: In a similar manner PRE(ds) were 
inserted at the restriction sites for PstI, Kpnl, Ehel, EcoO109 and Sapl 
in vector pTGFGl giving rise to the plasmid pTGFG33 (Figure 5), 
which is a vector that carries the GFP gene and five PREs, one each in 
20 position A, B, C, D, E (Figure 2). 

Production of the vectors pTGFG 36 , DTGFG53 and DTGFG64: The 
vector pUC19 (MBI Fermentas) was digested with Sail, treated with 
Klenow enzyme and dephosphorylated with alkaline phosphatase. It 

25 was ligated to the Notl-linker GCGGCCGC (Biolabs.# 1045), resulting 
in the vector pUC19/N. 

A 1.4 kb fragment containing the open reading frame of the 
human clotting factor IX, isolated from a human cDNA library (see 
example 2), was inserted into the Pstl-site of the vector pUC19/N 

30 which was digested with PstI, treated with T4-polymerase and 



_0O49147A1_l_> 



WO 00/49147 



PCT/EP00/01368 



29 

dephosphorylated with alkaline phosphatase. From the resulting 
vector pUC19/N-FIX a 1.4 kb fragment containing the open reading 
frame of the human clotting factor IX was cut out by double-digestion 
with Hind III and Notl. This fragment was ligated to the 4.3 kb 
fragment of the Hindlll and Notl double-digested vector pTGFG5 
resulting in the vector pTGFG36 shown in Figure 6. This vector is a 
preferred one for delivery of Factor IX into the cell, and its DNA 
sequence is provided in Figure 9 (SEQ ID NO: 1). 

In a similar manner plasmids pTGFG53 and pTGFG64 (shown in 
Figures 7 and 8) were obtained by exchanging the GFP gene in 
plasmids pTGFG20 and pTGFG33 by the FIX gene. 
Produ ction of the insert ALLGfds^: The oligonucleotides (Metabion) 
ALLG1/1 (5'-AGC TTG ACC TCG AGC AAG C-3') (SEQ. ID NO: 5) and 
ALLG2 (5'-GGC CGC TTG CTC GAG GTC A- 3') (SEQ. ID NO: 6) were 
hybridized and phosphorylated by kinase reaction, resulting in the 
inserts ALLG(ds). The insert ALLG (ds) was constructed to introduce 
into the vector of choice a sequence with a multiple cloning site for 
the possible introduction of other transgenes. 

Table 1 gives an overview of the available vectors with different 
transgenes and a different number of PREs in various positions. The 
positions of the PREs are given according to Figure 2. For the 
underlined vectors a map is provided (Figures 3 to 8). 



Table 1: Vectors of the invention 



Plasmid 


Trans- 
gene 


PRE 


Plasmid 


Trans- 
gene 


PRE 


Plasmid 


Trans- 
gene 


PRE 


pTGFGO 






pTGFGI8 


GFP 


BDE 


pTGFG34 


FIX 


E 


pTGFGI 


GFP 




pTGFG19 


GFP 


BCD 


pTGFG35 


FIX 


A 


pTGFG2 


FIX 




DTGFG20 


GFP 


ABD 


PTGFG36 


FIX 


D 


pTGFG3 


GFP 


E 


pTGFG21 


GFP 


CDE 


pTGFG37 


FIX 


C 



\ 
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pTGFG4 


GFP 


A 


pTGFG22 


GFP 


ach 

/V V^- YJ 


p 1 ^jtCjjo 


FIX 


B 


DTGFG5 


GFP 


D 


pTGFG23 


GFP 


ABC 


P 1 v.! r Li j j 


FIX 


ABD 


pTGFG6 


GFP 


C 


pTGFG24 


GFP 


ABE 


p I Vjrtjo4 


FIX 


ABCDE 


pTGFG7 


GFP 


B 


pTGFG25 


GFP 


ACE 


p I UrUOo 




A 


pTGFG8 


GFP 


BC 


pTGFG26 


GFP 


a DP 


p 1 uruo / 




D 


pTGFG9 


GFP 


BE 


pTGFG27 


GFP 


BCF 


p 1 LirLiOo 




C 


pTGFGlO 


GFP 


BD 


pTGFG28 


GFP 


BCDE 


pTGFG69 




B 


pTGFG 1 1 


GFP 


AB 


pTG FG29 


GFP 


ACDE 


pTGFG82 




ABD 


pTGFG13 


GFP 


CD 


pTGFG30 


GFP 


ABCE 


pTGFG95 




ABCDE 


pTGFG14 


GFP 


AC 


pTGFG31 


GFP 


ABDE 








pTGFG15 


GFP 


DE 


pTGFG32 


GFP 


A BCD 








pTGFG16 


GFP 


AD 


DTGFG33 


GFP 


ABCDE 









For the DNA sequence of pTGFG 36, pTGFG 53, pTGFG 64, pTGFG 67, 
pTGFG 82 and pTGFG 95, see SEQ ID NOs: 1 and 13 to 17, 
respectively. 



Example 2: Isolation of Human Factor IX cDNA 



Factor IX cDNA was amplified from human liver cDNA (Clontech) 
using two primers overlapping the start and termination codon of the 
factor IX open reading frame resulting in a 1387 bp fragment 
containing the entire open reading frame. Restriction sites for EcoRI 
(upstream) and BamHI (downstream) were included at the end of 
each primer to facilitate cloning. Amplification was performed with 
Pwo polymerase (Boehringer Mannheim) in 50 ml reaction volume [10 
mM Trls HCI P H 8 -85, 25 mM KCI, 5 mM (NH 4 ) 2 S0 4 , 2 mM MgS0 4 ] 
with 30 incubation cycles at 96°C for 1 min, 60°C for 1 min, 72°C for 
2 min, followed by a final extension step at 72°C for 10 min. 

Reaction products were ligated into the EcoRI- and BamHI-sites 
of pUC19 and transformed into E. coli DH5-a. Positive clones were 
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selected. Sequences were confirmed by cycle sequencing (Amersham) 
from both ends with labeled primers (IR-700) and automated analysis 
on the LiCor sequencing system (MWG, Biotech). 
The following primers were used : 

5 GGAATTCCGCAAAGGTTATGCAGCGCGTGAACATGATCATGGC 
(upstream; SEQ. ID NO: 7) 

CGCGGATCCATTAAGTGAGCTTTG I I I I I I CCTTAATCC (downstream ; 
SEQ. ID NO: 8) 

10 Example 3: Expression and Quantification of the Marker 
Protein GFP 

Method 1: HeLa cells were transfected by electroporation with 
plasmids pTGFG5 or pTGFG20. Transfected cells were harvested and 

is the cell pellets were homogenized and lysed in a buffer containing 
phosphate buffered saline (pH 7.5) and 10 mM PMSF. The 
concentration of green fluorescent protein (GFP) in the cell 
homogenate was determined by competitive ELISA. 

For this purpose, GFP was coated in a defined concentration on 

20 microtiter plates. Then, GFP samples were added in the presence of 
anti-GFP antibody. After several washing steps a labeled secondary 
antibody was added in order to trace the first antibody. The 
cblorimetric reaction was measured photometrically (extinction). 
Generally, the more GFP was added the less antibody was left to bind 

25 the coated GFP. Thus, reduction of extinction corresponded to higher 
GFP concentration in the sample. 

A concentration curve of GFP was determined by linear 
regression (Figure 11) using bovine serum albumin (BSA) as a 
reference. A mean value of 2.4 mg GFP/ml for pTGFG5 (1 PRE) and 

30 5.2 mg GFP/ml for pTGFG20 (3PREs) was found. 
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Figures 12 a-d show micrographs of HeLa cell cultures 
transfected with pTGFG5 (Fig. 12 a and b) and pTGFG20 (Fig. 12 c 
and d), respectively. Figures 12 a and c represent light microscopic 
views as controls, and Fig. 12 b and d show the corresponding cell 
5 patches in the fluorescent mode. Routinely, more than 50% of the 
cells expressed GFP, indicating very efficient transfection, the 
presence of only one PRE showing more efficient expression. 

Method 2: 293 T cells were transfected with pTGFG 5, 20 and 33 
io using calcium phosphate method and fluorescence was detected with 
a fluorimeter (Labsystems, Extinction: 485 nm Emission: 520 nm). In 
the case of the mock transfection, non GFP-expressing DNA was used. 
Background indicates the fluorescence of the empty plate (96-well 
plate, Dynex, Immulon-4). The results are summarized in Fig. 13. 
15 Again the vector with just one PRE (pTGFG5) shows the highest 
expression. 

Example 4: Human Factor IX Quantification by ELISA Assay 

20 HeLa cells were transfected either by electroporation or using 

liposome reagent DOTAP (Boehringer Mannheim) with plasmids 
PTGFG36, pTGFG53 and pTGFG64. These plasmids contain the cDNA 
of human clotting factor IX. Recombinant human factor IX was 
secreted into the supernatant of the cell culture and quantified using a 

25 sandwich ELISA method. 

0.11 M sodium citrate and 10 mM PMSF were added in order to 
prevent degradation of human factor IX. The enzyme-immunological 
in vitro assay "Asserachrom IX:AG" from Boehringer-Mannheim was 
used in order .to determine the concentration of expressed human 
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clotting factor IX. The factor IX-standard from Octapharma AG was 
used as a standard in aqueous solutions of 28 IU/ml. 

In six different transfection experiments, in which HeLa cells 
with piasmids containing human factor IX-cDNA (pTGFG36, 53 and 
5 64) were transfected using either electroporation or lipid-transfection 
reagent (DOTAP, Boehringer Mannheim), a concentration range of 3- 
25 ng/ml human clotting factor IX was reached. 

Example 5: Production and Purification of hPR (A Form) 

10 

1. Cloning of the human progesterone receptor: The cloning was 
performed as follows: Total human RNA was isolated from human 
white blood cells or liver cells using cell lysis in guanidinium 
hydrochloride buffer and CsCI-density centrifugation. 
15 For cloning of the hPR coding sequence, hPR specific cDNA was 
prepared and used for amplification of the hPR coding sequence in two 
fragments by PCR. 

The following oligonucleotide primers were selected based on the 
published mRNA sequence (Genbank: NM_000926 and X51730). 
20 Oligonucleotides used were obtained from MWG, Ebersberg or 
Metabion, Munchen. All primers used are listed 5' to 3', bases added 
to introduce restriction sites are in capital letters and restriction sites 
used for cloning are underlined. 

hPGR-5'-primer: CGA GGA tec agt cgt cat gac tga gc (SEQ ID NO: 9); 
25 hPGR-3'-primer: GCA GAA TT c at tat aaa aac tea aga cct cat aat cct 
gac (SEQ ID NO: 10); 

hPGR-internal primer (Sal I) 1: etc etc ggg gtc gac cct gg (SEQ ID NO: 
11); 

hPGR-internal primer (Sal I) 2: cca gg g teg ac c ccg agg ag (SEQ ID 
30 NO: 12). 
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Synthesis of cDNA was perfumed using 3 M9 of total RNA and 200 
pmo 0 „ he 3 , primer with SuperScr|pt n reverse transc . ptase 

ReaCti ° n volume was 50 pi and buffer was used as 
recommended, supplemented with RNase Inhibitor and 10 m M DTP 
and 1 m M dNTPs. Before adding the enzyme, sampies were heated to 
80 C for 10 min, followed by 10 mm at 72°C and 10 min at 42°C 
Superscript II RT was added at 42°C and reaction was continued for 
IS mm at 42°C, 15 min at 50°C and 1 h at 58°C. 

The CDMA obtained from this synthesis reaction was used to ampiify 
he hPGR coding sequence in two fragments by PGR. One fragment 
(5 ) w,th 5 -primer and internal primer 2 and one fragment (3') with 3' 
pnmer and internal primer 1. Reaction setup in SO pi was : Pwo 
polymerase (Roche Diagnostics,, buffer as supplied by Roche 
Diagnostics, supplemented with DMSO, 50 pmol of each primer and 
0.2 mM dNTPs. Reaction conditions were.- 10 min 96°C followed by 35 
cycles of 1 min 96°C, 2 min at 59°C, 2 mm 72-C and a final extension 
step at 72°C for 10 min. 

PCR-products were purified by ge, electrophoresis and digested with 
Sai l. The BamHI and Hind III sites introduced in the primer were not 
used to avoid cutting at two internal restriction sites of the h PR coding 
sequence. Both fragments were ligated into pBluescript SK+ vector 
cut with EcoRV through Hunt end ligation into the vector and sticky 
end hgation through the Internal Sal I site. Vectors containing the 
appropriate insert were identified by mini-prep, restriction digest and 
sequencing. The obtained vector was designated pTGhPRi 
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2. Production of hPR (A-forrn^: Initially, the gene for hPR-B inclusive 
its 3'-UTR was cut out from pTGh PR1 and cloned in frame in the 
multiple cloning site of the expression plasmid pFASTBAC HTc (BAC- 
to-BAC Baculovirus Expression System, Life Technologies). This 

5 resulted in an expression casette of a N-terminally histidine-tagged 
version of hPR-B under expression control of the viral polyhedrin 
promotor as shown below. A rTEV protease cleavage site is located 
between the six histidine residues and the initial methionine of the 
hPR-B reading frame, which allows removal of the histidine residues 

10 from the expressed protein. The N-terminal region of the expression 
cassettes is shown below. 

M S Y YHHHHHH D Y D I P T T E N L Y FQ * * G AMG I RN S T - hPR- gen 
. 6 x His .. ... 
•5 spacer . V ■ 

rTEV cleavage site 

Amino acids are presented in the single letter code. The cleavage site 
of the rTEV protease is represented by ** 

20 

In order to generate the expression cassette for the truncated hPR-A 
form, the DNA sequence encoding for the amino acids between Met 1 
and Met 165 of the hPR-B form was removed using a PCR-based 
strategy. Two primer pairs were designed which allowed amplification 

25 of either a DNA fragment just downstream of the start AUG of the 
hPR-B gene and a DNA-fragment just upstream of the AUG coding for 
Met 165, respectively. In a subsequent PCR reaction these two DNA 
fragments were annealed to each other at their homologous B'-ends, 
and amplified using the outermost amplification primers. The resulting 

30 DNA-fragment was digested by EcoRI and Mlu I and the cleavage 
product was exchanged against the corresponding fragment of the 
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hPR-B expression cassette in the pFASTBAC HTc vector. Thereby the 
reading frame coding for an N-terminal histidine tagged version of the 
hPR-A polypeptide (94kDA) was restored. 

This 6xHis-tag was utilised for affinity purification of the protein 
5 by immobilized cobalt 2+ affinity chromatography on a TALON® resin 
(Clontech). The procedure, following the method of 
Boonyaratanakornkit et al. Mol. Cell. Biol. 18, 4471 (1998), was as 
follows (all steps were carried out at 0 to 8°C): 

Sf9 cells were cultivated in monolayer culture in serum free SF900 
10 medium. Viral infection of the cells was done at a multiplicity of 
infection (MOI) of 5-8. 

The harvesting was done 48 hours after infection with baculovirus 
containing the hPR expression cassette and lysed mechanically by 
homogenising in buffer A containing 20 mM Tris-CI pH 8.0, 350 mM 

is NaCI, 10 mM imidazol, 5% glycerol and a cocktail of proteinase 
inhibitors (Complete™ EDTA-free, Roche Diagnostics, Penzberg, 
Germany). After a 10 min centrifugation at 10000 x g, supernatant 
originating from 10 8 cells was incubated for 1 h with 0,5 ml settled 
TALON® resin equilibrated in buffer A. TALON® was washed with 20 

20 volumes of buffer A. hPR-A was eluted with 10 Vol buffer B, containing 
all ingredients of buffer A, but 100 mM imidazol. The eluate was 
concentrated 50-fold and dialysed against 100 volumes buffer C (PBS 
+ 100 nM progesteron) by centrifugal ultrafltration at a molecular 
exclusion size of 10 kDa (Centricon Plus-20 PL- 10, Millipore, Eschborn, 

25 Germany). 

3. Deter mination of identity, purity and yield of hPR-A: Purity and 
yield of the product were determined by application on denaturing 
reducing polyacrylamid- geleiectrophoresis according to Laemmli, U. 
so et al., Nature 227, 680-685 (1970) and subsequent staining with 
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coomassie* blue R250. By this one-step procedure hPR-A was 
enriched to a final specific hPR content of 0.2 - 0.5 mg hPR/mg 
protein. As depicted in Figure 15, lane A, the final preparation 
consisted predominantly of two distinct protein species displaying 
apparent molecular masses of 94 and 74 kDa (Fig. 15, arrows). 

Yield was estimated by parallel separation of standardised 
protein preparations. Data taken from a set of three separate 
experiments hint at a typical yield of 30 pg enriched hPR A-receptor 
per 10 8 cells. 

Identity of hPR was determined by immunodetection of the 
product transferred to nitrocellulose by western blotting with mouse 
monoclonal antibodies directed against recombinant hPR (PR Ab-1, 
Oncogene, Cambridge, MA, USA). 

the final product was transferred to nitrocellulose BA-83 and 
immunostained as described above. As presented in Figure 15, lane C, 
three major protein bands were detected, including the two dominant 
protein species described above. The smaller sized bands may display 
copurified proteolytic fragments of hPR. 

Intracellular GFP from adherent cells was detected by a 
fluorimeter after media was taken off and PBS (colourless) was added. 
The results are summarized in Fig. 13. 

Example 6: Clotting Activity of Human Clotting Factor IX from 
Transfected 293 T Cells 

A concentration range of 55 - 95 ng/ml human clotting factor IX has 
been reached by transfection of 293 T-cells with plasmids containing 
human factor IX-cDNA (pTGFG 36, 53, 64 and 2) in 11 different 
experiments using ELISA (Example 4). 

Clotting activity was deterined with a partial thromboplastin time 
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assay using Cephalin (phosphatidyl ethanoiamine) activation with a 
manual coagulation instrument (ML-2, Instrumentation Laboratories). 
For the study, 100 pi undiluted supernatant from transfected 293 T- 
cells, 100 pi deficiency plasma (Progen) and 100 pi Cephalin 
5 (Instrumentation Laboratories) were incubated for 5 minutes at 37°C. 
Coagulation was started by adding 100 pi CaCI 2 . Sample coagulation 
time was compared to normal plasma. 



Number of cells 
[/ml] 


Factor IX-concentration 
[ng/ml] 


Clotting time 
[s] 


2,1 x 10 5 


36 


L 45 


8,7 x 10 5 


■ 20 


79 



10 Normal plasma: 

Factor IX deficient plasma: 

Example 7: Analysis of an Additive Effect of Human Clotting 
Factor IX on the Clotting Time of Mice Blood 

15 

1. Clotting time: Clotting activity was determined with a partial 
thromboplastin time assay using Cephalin (phosphatidyl 
ethanoiamine) activation with a manual coagulation instrument (KC 4 
A, Amelung). 

20 For the study, 5 pi mouse blood, 20 pi deficiency plasma (Progen) ad 
100 pi physiological NaCI and 100 pi DaPPTin (Progen) were incubated 
for 2 minutes at 37°C. Coagulation was started by adding 100 pi 
CaCI 2 . 

To analyse the additive effect, human clotting factor IX 
25 (housestandard, Octapharma) was added to the mouse blood and 
diluted 1:10 within the system. As it is shown in Figure 15, the 
additive effect of human clotting factor IX on clotting activity can be 
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detected up to a limit concentration of 0,07 mlU hFIX/ml (= 31,5 
ng/ml). 

2. ELISA: The addition of human clotting factor IX to the mouse blood 
55 was monitored by ELISA as described in Example 4. Citrate plasma 
was made out of mouse blood and human clotting factor IX was added 
in different concentrations. 



No. 


Description 


Concentration 
[mlU/ml] hFIX added 


Extinction at 405 
nm [-] 


1. 


Mouse Citrate Plasma 


7 


0,204 


2. 


Mouse Citrate Plasma 


2 


0,130 


3. 


Mouse Citrate Plasma 




0,099 


4. 


Control: 1.+2. Antibody 
without antigen 




0,096 


5. 


Control: 1. Antibody 
without antigen 




0,072 


6. 


Control: 2. Antibody 
without antigen 




0,085 


7. 


Substrate (ABTS) 




0,072 



io Mouse plasma without the addition of human clotting factor IX showed 
an extinction of 0,099 at 405 nm background. When added human 
factor IX in a concentration of 2 mlU/ml (= 9 ng/ml human factor IX) 
the detection limit is reached. It can be deduced that the antihuman 
factor IX antibodies used in the ELISA are not cross-reactive with 

15 mouse coagulation factor IX. 
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Exampie 8: Cloning and Activity Testing of the Human 
Progesterone Receptor (hPR) 

2. Activity Ieaina : The human progesterone receptor encoded in 
plasmid pTGhPRl (s. Example 8.! above) was tested for its 
phyiological activity. In a functional form and after activation with a 
progestin like R5020 the receptor should be able to induce the 
expression of luciferase from a Mouse Mammary Tumor Virus (MTV) 
promoter. 

To test this 293T cells were grown in phenol red-free DMEM 
supplemented with 10% charcoal-filtrated fetal calf serum and with or 
without 10 nM of R5020 (NEN) in 6 well plates. Transfections were 
performed by the calcium phosphate method using 2 ug of a pSG- 
hPRl construct and pMTV-luc (Hollenberg et al., 1985, Cell 55 p899- 
906) per well. One day after transfection the cells were washed in PBS 
and the luciferase expression assayed with the Berthold luciferase kit 
according to the manufacturer's directions in a fluorimeter 
(Labsystems). The controls were as follows: R5020 was omitted 
(PR+MTV) and both plasmids alone were transfected with (PR+R5020 
MTV+R5020) and without R5020 (PR, MTV). As positive control a 
Plasmid with a CMV-driven luciferase gene was transfected (pCMV- 
luc). VH 

As can be seen in Figure 19, there is a clear induction of 
luciferase expression when all the necessary elements are present 
that ,s human progesterone receptor, progestin R5020 and the M TV- 
driven luciferase gene (PR+MTV+R5020). The error bars give the 
standard deviation of a threefold experiment, the readout is relative 
light units (RLU). 
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Example 9: Oral Gene Transfer in in vivo Animal Experiment 

Purpose of experiment- The object of this pilot study is to prove oral 
gene transfer in an in vivo animal experiment. Successful gene transfer 
is established by coagulation measurement: an additive effect of 
expressed human factor IX on the coagulation time of healthy murine 
whole blood is expected. The presence of expression of human factor IX 
in mouse blood is quantitated by ELISA. 



10 



15 



20 



Animals: The animals employed are 35 male C57BL/6J mice from Iffa 
Credo, France, with an initial age of 9 weeks and a weight of 23-33 g. 
The mice are kept in groups of 7 animals each in conventional test 
animal cages with wooden chips in the Institut fur Experimented 
Onkologie und Therapieforschung der Technischen Universitat 
Munchen. 

The animals are fed ad libitum with "Altrum Ratten und Mause 
Haltung" and are given tap water, also ad libitum. 

The test animal cages are kept at an ambient temperature of 19- 
24°G and a humidity of 55 5%. The room is additionally provided with 
an automatic light supply which maintains a 12 hours rhythm. 
The test animals are supervised by specialized staff. 







Group 


i-iormone 


Hormone 
receptor 


Plasmid 


Aqua dest. 


Route of 
administration 


1. - 

2. - 

3. - 

4. - 

5. - 


100 pi 
100 pi 


4.35 pg 


10 pg 
10 pg 
10 pg 
10 pg 


100 pi 
50 pi 


oral 
oral 
i.m. 
oral 
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Plasmid and hPR: Theragene GmbH 

Hormone: Utrogest 0 by Dr. Kade/Besins Pharma GmbH, 

Rigistr. 2, D-12277 Berlin 
Aquadest.: Aqua ad injectabilia Delta-Pharma GmbH, 72793 

5 Pfullingen 

Esophageal sound: Vein catheter, diam. 0.5 x 0.9 mm, 

Lot 7077 G2221, B. Braun Melsungen AG, Western 

Germany 

i.m. injection: Micro-Fine 12.7 mm, Becton Dickinson GmbH, 
io Tullastr. 8-12, D-69126 Heidelberg 

Course of experiment: The 35 mice were divided into 5 groups of 7 
mice each. One group serves as a control, the second group was daily 
administered a total of 100 pi of hormone and plasmid via the gastro- 
intestinal tract orally with an esophageal sound, the third group was 
is daily administered a total of 100 pi of plasmid with aqua dest. orally 
with an esophageal sound, the fourth group was administered a total of 
50 pi of plasmid with aqua dest. i.m. into . the musculus quadriceps 
femoris, the fifth group was daily administered a total of 100 pi of 
hormone, hormone receptor and plasmid orally with an esophageal 
20 sound. 

About 2-3 hours before the manipulation, . the mice were 
prewarmed under a red light. Immediately before, during and after the 
manipulation, the mice were examined and supervised by a 
veterinarian. 

25 Blood sampling from the mice was performed daily from the 

caudal artery of animals slightly sedated by inhalation anesthesia. For 
this purpuse the artery was punctured with a disposable injection 
cannula (0.90 x 40 mm). Whole blood welling out of the puncture site 
(5 pi of blood) was immediately collected with an Eppendorf pipette. 

30 Without further delay, the blood coagulation time in seconds was 
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determined using an Amelung-Koagulometer KC 4A by means of an 
aPTT assay (activated partial thromboplastin time). The blood 
coagulation analysis was always performed by the same person. 
Immediately after the blood sampling, the bleeding was stopped by 
5 compression. 

Sedation of the mice was achieved by inhalation anesthesia 
(active substance: isoflurane: Forene , Abbott GmbH, 65205 
Wiesbaden, Western Germany) in a whole body chamber. 
The daily manipulation was performed through an overall period of 7 

10 days. This was followed by a day (day 8 of experiment) without any 
manipulation, and at day 9 of experiment, again 5 pi of whole blood 
was withdrawn from the ventral caudal artery under anesthesia, and 
the coagulation time established as described above. Further, 0.5- 
0.75 ml of whole blood was collected intracardially using U-40 insulin 

15 syringes (Mikro-Fine 12.4 mm) filled with 50-75 ul of sodium citrate 
(3.1%), transferred into Eppendorf cuvettes, and about 100 pi of whole 
blood with citrate was reserved for PCR examination and stored in a 
cool environment. The remaining citrate blood was centrifuged for 
10 min using a centrifuge 6000 rpm, 4°C, at 5000 rpm, and the plasma 

20 was recovered for the ELISA determination of the factor IX 
concentration. 

Then, the animals were sacrificed using 0.5 ml Narkoren i.p. 
Immediately after the sacrificing, the animal bodies were dissected. 
The following organs were removed from the mice for an 
25 immunohistochemical examination: brain, spleen, liver, kidneys, testes, 
lungs, m. quadriceps femoris, heart, appendix; and frozen at -80°C. 

Deviation from the scheduled experimental course: Due to the poor 
general condition of the mice in the course of the long-term 
30 administration series, the administration had to be interrupted at days 
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3 (except one mouse) and 5 for test group 2 (hormone and piasmid), at 
days 3 and 5 for group 5 (hormone, hormone receptor and piasmid) 
and two mice were additionally spared the administration of the 
reagents at days 2 and 7 of the experiment. 

The poor general condition is accounted for by the hypnotic effect 
of the hormone progesterone. It causes the mice to sleep for about 24 
hours without eating and drinking. This again has an adverse effect on 
the water balance of the mice, resulting in exsiccotic phenomena and 
apathic behavior. Therefore, the mice were prophylactically treated with 
a subcutaneous administration of 1 ml of 5% glucose solution (Delta 
Pharma GmbH, 72793 Pfullingen) and 1 ml of Ringer solution (Delta 
Pharma GmbH, 72793 Pfullingen) when the hormone was administered 
orally. Among the group which was orally administered hormone, 
hormone receptor and piasmid, two mice died at days 3 and 6, 
respectively; they were dissected. 

Among the group which was orally administered hormone with 
piasmid, one mouse was found dead in its cage on day 8 of the 
experiment; it was also dissected. m 

The results are summarized in Figures 17 and 18. The statistical 
evaluations were performed according to the generalized linear model 
with repeated measurements (MANOVA with repeated measurements). 
In none of the test groups a non-linear course was observed. 
Therefore, the course was calculated by a simple representation of the 
linear increase or decrease, namely initial value minus final value per 
mouse. The particularly interesting difference between the control and 
the group "piasmid in the hormone with hormone receptor" (group 5) 
was examined using a T test for independent random samples. 

Figure 17 shows the mean values of the calculated differences: In 
the control, for example, this difference was about 50 seconds. The 
vertical lines show plus and minus one standard deviation from these 
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values. The T test is based both on the differences between the mean 
values and on the degree of overlapping which can be seen from these 
lines: The larger the overlapping, the less is the significance of the 
mean value differences. Thus, the groups "control" and "plasmid and 
water i.m." (groups 1 and 5, respectively) are distinguished in a purely 
numerical way in the mean value, but the degree of overlapping is so 
high that these groups are not significantly different. 

The only significant difference was between group 1 and 5: The 
decrease of the latter is significantly higher than that of the control (T = 
-2.357; d.f. = 12; p < 0.05). 

The following Tables contain the concluding statistics and the 
results of the statistical tests (T test) performed on the differences 
between the mean values obtained in the course of the test: 



Group statistics 



ADMIN 


N 


mean value 


standard 
deviation 


standard error of 
the mean value 


DIF control 


7 


47.3857 


58.9946 


22.2978 


Hormone, hormone receptor 
and plasmid orally 


7 


114.7571 


47.3300 


17.8891 



20 
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Test for independent random samples 





Levene test for 






T test for equal mean values 








equal variance 


















F 


Signifi 
cance 


T 


df 


sig. (2-sided) 


mean 
difference 


standard error 
of difference 


95% confidence interval 
of difference 


















lower 


upper 


DIF variances are 


0.026 


0.874 


•2.357 


12 


0.036 


•67.3714 


28.5869 


-129.6570 


-5.0858 


equal 




















Variances 
are not equal 






-2.357 


11.461 


0.037 


-67.3714 


28.5869 


•129.9833 


-4.7596 



The human F IX was also detectable in the treated mice of the 
"hormone-hormone reception and plasmid orally group using an Elisa 
5 as described in Example 4. 
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Claims 

1. Use of a nucleic acid construct comprising at least one hormone 
responsive element (HRE) and a transgene for preparing an agent for 
gene transfer. 

2. The use of claim 1, wherein the at least one HRE is functionally 
linked to the transgene or not. 

3. The use of claim 1 or 2, wherein the transgene is selected from the 
group consisting of genes encoding a blood clotting factor, hormone 
genes, hormone receptor genes, growth factors, enzyme genes, genes 
encoding cytokines or lymphokines, genes encoding inhibitor 
substances, genes encoding substances that function as drugs or 
vaccines, and antisense sequences. 

4. The use of claim 3, wherein the transgene is a gene encoding a 
blood clotting factor and the agent is suitable for treating hemophilia. 

5. The of claim 4, wherein the human blood clotting factor is selected 
from the group consisting of factor VIII, factor IX, and von Willebrand 
Factor (vWF). 

6. The use of claims 1 to 5, wherein the nucleic acid construct 
comprises 1 to 20, preferably 3 to 10 HRE(s). 

7. The use of claim 1 to 5, wherein the at least one HRE is a steroid 
responsive element, preferably a progesterone responsive element 
(PRE). 
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8. The use of claim 5, wherein the HRE is a PRE and the blood clotting 
factor is factor IX, preferably the factor IX has a nucleotide sequence 
of 689 to 2071 of SEQ ID NO: 1. 

9. The use of claim 5, wherein the HRE is a PRE and the blood clotting 
factor is factor VIII. 



10. The use of claim 7 to 9, wherein the PRE has the double stranded 
DNA sequence comprised of the DMA sequences of SEQ ID NOs: 3 and 

io 4. 

11. The use of claims 1 to 10, wherein the construct further comprises 
functional DNA sequences selected from the group consisting of 
promoter sequences, enhancer sequences, silencer sequences, origin 

is of replication sequences, integrational sequences, marker genes and 
switch sequences. 

12. The use of claim 11, wherein the construct further comprises a 
tissue-specific promoter, preferably an a-antitrypsin promoter. 

20 

13. The use according to any one of claims 1 to 12, wherein the agent 
further comprises a hormone-hormone receptor complex, preferably a 
steroid-steroid receptor complex. 

5 14. The use of claim 13, wherein the molar ratio of HRE within the 
nucleic acid construct to hormone receptor is from 1:1 to 1:10, 
preferably 1:2 to 1:5, and/or the molar ratio of hormone to hormone 
receptor is at least 1000:1, preferably at least 10000:1. 

o 15. The use of claim 13 or 14, wherein the receptor is a progesterone 
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receptor and the steroid is progesterone or a progesterone derivative. 

16. The use of claim 15, wherein the progesterone is natural 
micronized progesterone solubilized in a liphophilic matrix system 

5 and/or the progesterone receptor is hPR-A, hPR-B or comprises the 
nucleotide sequence of 557 to 933 SEQ ID NO: 9. 

17. A nucleic acid construct comprising at least one HRE and a 
transgene, wherein one of said at least one HREs is not functionally 

10 linked to the transgene. 

18. The nucleic acid construct of claim 17, which is as defined in 
claims 3 to 12. 

15 19. A vector comprising the nucleic acid construct of claim 17 or 18. 

20. A transformed cell or transgenic organism comprising the nucleic 
acid construct as defined in claims 17 or 18 or the vector as defined in 
claim 19. 

20 

21. A composition of matter comprising a nucleic acid construct 
comprising at least one HRE and a transgene as defined in claim 17 or 
18 and/or a vector as defined in claim 19, said at least one HRE being 
coupled to a hormone-hormone receptor complex. 

25 

22. The composition of matter of claim 21, wherein the hormone- 
hormone receptor complex is as defined in claims 13 to 16. 

23. The composition of matter of claim 21, wherein the transgene is a 
30 gene encoding a blood clotting factor. 
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24. The composition of matter of claim 21 wherein the blood clotting 
factor is factor IX. 

5 25. The composition of matter of claim 21 wherein the blood clotting 
factor is factor VIII. 

26. A method for preparing the composition of matter as defined in 
claim 21, which method comprises admixing the nucleic acid construct 

io with the hormone receptor and the hormone. 

27. A pharmaceutical composition comprising the nucleic acid 
construct of claim 17 or 18, the vector of claim 19, and/or the 
composition of matter of claim 21 to 25. 

15 

28. The pharmaceutical composition of claim 27, which is suitable for 
gene transfer, preferably for treating hemophilia. 

29. A method for gene transfer which comprises administering the 
20 agent as defined in claims 1 to 16, or the composition of matter as 

defined in claims 21 to 25 to an organism or to a cellular system. 

30. A method for delivering into an organism or into a cellular system 
a nucleic acid encoding a transgene to be expressed in the cells of the 

25 organism or the cells of the cellular system, which method comprises 
administering an agent as defined in claims 1 to 16 or composition of 
matter as defined in claims 21 to 25 to the organism or to the cellular 
system so that the hormone in the composition interacts with the cell 
membrane and therewith enhances diffusion and transport of the 
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nucleic acid that is coupled to the hormone-hormone receptor 
complex across the membrane and into the cell. 

31. The method of claim 30, wherein a nucleic acid encoding human 
5 factor VIII or factor IX is delivered into the cell. 

32. A method of treating blood clotting disorders comprising 
administering a therapeutically effective amount of the composition of 
matter of claim 23 to an organism or to a cellular system. 

io 

33. A method of treating hemophilia B, comprising administering a 
therapeutically effective amount of the composition of matter of claim 

24 to an organism or to a cellular system. 

15 34. A method of treating hemophilia A, comprising administering a 
therapeutically effective amount of the composition of matter of claim 

25 to an organism or to a cellular system. 

35. Use of a steroid hormone for preparing an agent for gene transfer. 

20 

36. The use of claim 35, wherein the steroid hormone is a natural 
micronized steroid hormone, preferably natural micronized 
progesterone. 

25 37. The use of claim 36, wherein the natural micronized steroid 
hormone is solubilized in a lipophilic matrix system. 

38. A method for gene transfer which comprises administering a 
nucleic acid construct to an organism or to a cellular system, wherein 
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the nucleic acid construct contains a transgene and is encapsulated in 
a steroid hormone. 
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Fig. 2 




(MSDOCID- <WO 0049147A1_I_> 



WO 00/49147 



3/22 



PCT/EPOO/01368 



Fig. 3 
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Fig. 4 
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Fig. 5 
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Fig. 6 
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Fig. 7 
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CGCGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTC 

CGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTA 

TGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAG 

TACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAG 

TACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTG 

GCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTT 

TGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGT 

GTACGGTGGGAGGTCTATATAAGCAGAGCTCTCTGGCTAACTAGAGAACCCACTGCTTACTGGCTTATCGAAATTAATAC 

GACTCACTATAGGGAGACCCAAGCTTGCATGCCAATTCCGCAAAGGTTATGCAGCGCGTGAACATGATCATGGCAGAATC 

ACCAGGCCTCATCACCATCTGCCTTTTAGGATATCTACTCAGTGCTGAATGTACAGTTTTTCTTGATCATGAAAACGCCA 

ACAAAATTCTGAATCGGCCAAAGAGGTATAATTCAGGTAAATTGGAAGAGTTTGTTCAAGGGAACCTTGAGAGAGAATGT 

ATGGAAGAAAAGTGTAGTTTTG7UVGAAGCACGAGAAGTTTTTGAAAACACTGAAAGAACAACTGAATTTTGGAAGCAGTA 

TGTTGATGGAGATCAGTGTGAGTCCAATCCATGTTTAAATGGCGGCAGTTGCAAGGATGACATTAATTCCTATGAATGTT 

GGTGTCCCTTTGGATTTGAAGGA7UVGAACTGTGAATTAGATGTAACATGTAACATTAAGAATGGCAGATGCGAGCAGTTT 

TGTAAAAATAGTGCTGATAACAAGGTGGTTTGCTCCTGTACTGAGGGATATCGACTTGCAGAAAACCAGAAGTCCTGTGA 

ACCAGCAGTGCCATTTCCATGTGGAAGAGTTTCTGTTTCACAAACTTCTAAGCTCACCCGTGCTGAGACTGTTTTTCCTG 

ATGTGGACTATGTAAATTCTACTGAAGCTGAAACCATTTTGGATAACATCACTCAAAGCACfcCAATCATTTAATGACTTC 

ACTCGGGTTGTTGGTGGAGAAGATGCCAAACCAGGTCAATTCCCTTGGCAGGTTGTTTTGAATGGTAAAGTTGATGCATT 

CTGTGGAGGCTCTATCGTTAATGAAAAATGGATTGTAACTGCTGCCCACTGTGTTGAAACTGGTGTTAAAATTACAGTTG 

TCGCAGGTGAACATAATATTGAGGAGACAGAACATACAGAGCAAAAGCGAAATGTGATTCGAATTATTCCTCACCACAAC 

TACAATGCAGCTATTAATAAGTACAACCATGACATTGCCCTTCTGGAACTGGACGAACCCTTAGTGCTAAACAGCTACGT 

TACACCTATTTGCATTGCTGACAAGGAATACACGAACATCTTCCTCAAATTTGGATCTGGCTATGTAAGTGGCTGGGGAA 

GAGTCTTCCACAAAGGGA.GATCAGCTTTAGTTCTTCAGTACCTTAGAGTTCCACTTGTTGACCGAGCCACATGTCTTCGA 

TCTACAAAGTTCACCATCTATAACAACATGTTCTGTGCTGGCTTCCATGAAGGAGGTAGAGATTCATGTCAAGGAGATAG 

TGGGGGACCCCATGTTACTGAAGTGGAAGGGACCAGTTTCTTAACTGGAATTATTAGCTGGGGTGAAGAGTGTGCAATGA 

AAGGCAAATATGGAATATATACCAAGGTATCCCGGTATGTCAACTGGATTAAGGAAAAAACAAAGCTCACTTAATGGGAT 

CGGTCGAGCGGCCGCGACTCTACTAGAGGATCTTTGTGAAGGAACCTTACTTCTGTGGTGTGACATAATTGGACAAACTA 

CCTACAGAGATTTAAAGCTCTAAGGTAAATATAAAATTTTTAAGTGTATAATGTGTTAAACTACTGATTCTAATTGTTTG 

TGTATTTTAGATTCCAACCTATGGAACTGATGAATGGGAGCAGTGGTGGAATGCCTTTAATGAGGAAAACCTGTTTTGCT 

CAGAAGAAATGCCATCTAGTGATGATGAGGCTACTGCTGACTGTCAACATTCTACTCCTCCAAAAAAGAAGAGAAAGGTA 

GAAGACCCCAAGGACTTTCCTTCAGAATTGCTAAGTTTTTTGAGTCATGCTGTGTTTAGTAATAGAACTCTTGCT.TGCTT 

TGCTATTTACACCACAAAGGAAAAAGCTGCACTGCTATACAAGAAAATTATGGAAAAATATTCTGTAACCTTTATAAGTA 

GGCATAACAGTTATAATCATAACATACTGTTTTTTCTTACTCCACACAGGCATAGAGTGTCTGCTATTAATAACTATGCT 

CAAAAATTGTGTACCTTTAGCTTTTTAATTTGTAAAGGGGTTAATAAGGAATATTTGATGTATAGTGCCTTGACTAGAGA 

TCATAATCAGCCATACCACATTTGTAGAGGTTTTACTTGCTTTAAAAAACCTCCCACACCTCCCCCTGAACCTGAAACAT 

AAAATGAATGCAATTGTTGTTGTTAACTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTT 

CACAAATAAAGCATTTTTTTCACTGCATTCTAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTATCATGTCTGGATCC 

CCGGGTACCCTCTAGAGCGAATTAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAA 

CTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACA 

GTTGCGCAGCCTGAATGGCGAATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATAT 

GGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCC 

TGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTC 

ACCGTCATCACCGAAACGCGCGAGACGAAAGGGGGGGTACCAGCTTCGTAGCTAGAACATCATGTTCTGGGATATCAGCT 

TCGTAGCTAGAACATCATGTTCTGGTACCCCCCTCGTGATACGCCTATTTTTATAGGTTAATG.TCATGATAATAATGGTT 

TCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATAT 

GTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCC 

GTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAT^ACGCTGGTGAAAGTAAAAGAT 

GCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCC 

CGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAG 

AGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGAT 

GGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGAC7^ACGAT 

CGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGC 

TGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACT 

GGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCG 

CTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCAC 

TGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGA 

CAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGA 

TTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTG 

AGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATC7VAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATC 

TGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAA 

GGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTTCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACT 

CTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACC 

GGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTT 

GGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGG 
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Fig. 9 (continued) 



CGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCAGGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTT 
TATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAA 
AAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCC 
CTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAG 
TCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTG 
GCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCTCACTCATTAGGCACCCC 
AGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGAT7VACAATTTCACACAGGAAACAGCTAT 
GACCATGATTACGCCAAGCTCTCTAGAGCTCTAGAGCTCTAGAGCTCTAGAGAGCTTGCATGCCTGCAGGTCG 
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Met Gin Arg Val Asn Met lie Met Ala Glu Ser Pro Gly Leu lie Thr 
1 5 10 ' 15 

lie Cys Leu Leu Gly Tyr Leu Leu Ser Ala Glu Cys Thr Val Phe Leu 
20 25 30 

Asp His Glu Asn Ala Asn Lys lie Leu Asn Arg Pro Lys Arg Tyr Asn 
35 40 " 4 5 

Ser Gly Lys Leu Glu Glu Phe Val Gin Gly Asn Leu Glu Arg Glu Cys 
5 0 55 60 

Met Glu Glu Lys Cys Ser Phe Glu Glu Ala Arg Glu Val Phe Glu Asn 
65 70 75 80 

Thr Glu Arg Thr Thr Glu Phe Trp Lys Gin Tyr Val Asp Gly Asp Gin 

85 90 95 

Cys Glu Ser Asn Pro Cys Leu Asn Gly Gly Ser Cys Lys Asp Asp lie 
100 105 ' 110 

Asn Ser Tyr Glu Cys Trp Cys Pro Phe Gly Phe Glu Gly Lys Asn Cys 
115 120 125 

Glu Leu Asp Val Thr Cys Asn lie Lys Asn Gly Arg Cys Glu Gin Phe 
130 135 140 

Cys Lys Asn Ser Ala Asp Asn Lys Val Val Cys Ser Cys Thr Glu Gly 
145 150 155 '[ 160 

Tyr Arg Leu Ala Glu Asn Gin Lys Ser Cys Glu Pro Ala Val Pro Phe 
165 170 175 

Pro Cys Gly Arg Val Ser Val Ser Gin Thr Ser Lys Leu Thr Arg Ala 
180 185 190 

Glu Thr Val Phe Pro Asp Val Asp Tyr Val Asn Ser Thr Glu Ala Glu 
195 200 205 

Thr lie Leu Asp Asn lie Thr Gin Ser Thr Gin Ser Phe Asn Asp Phe 
210 215 220 

Thr Arg Val Val Gly Gly Glu Asp Ala Lys Pro Gly Gin Phe Pro Trp 
225 230 235 240 

Gin Val Val Leu Asn Gly Lys Val Asp Ala Phe Cys Gly Gly Ser lie 
245 250 " 255 

Val Asn Glu Lys Trp lie Val Thr Ala Ala His Cys Val Glu Thr Gly 
260 265 270 

Val Lys lie Thr Val Val Ala Gly Glu His Asn lie Glu Glu Thr Glu 
275 280 285 

His Thr Glu Gin Lys Arg Asn Val lie Arg lie lie Pro His His Asn 
290 295 300 
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Fig. 10 (continued) 

Tyr Asn Ala Ala He Asn Lys Tyr Asn His Asp He Ala Leu Leu Glu 
305 310 315 320 

Leu Asp Glu Pro Leu Val Leu Asn Ser Tyr Val Thr Pro He Cys He 
325 330 335 

Ala Asp Lys Glu Tyr Thr Asn lie Phe Leu Lys Phe Gly Ser Gly Tyr 
340 345 350 

Val Ser Gly Trp Gly Arg Val Phe His Lys Gly Arg Ser Ala Leu Val 
355 360 365 

Leu Gin Tyr Leu Arg Val Pro Leu Val Asp Arg Ala Thr Cys Leu Ara 
370 375 380 y 

Ser Thr Lys Phe Thr He Tyr Asn Asn Met Phe Cys Ala Gly Phe His 
385 390 395 " ~ 400 

Glu Gly Gly Arg Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro His Val 
405 410 415 

Thr Glu Val Glu Gly Thr Ser Phe Leu Thr Gly He He Ser Trp Glv 
420 425 430 

Glu Glu Cys Ala Met Lys Gly Lys Tyr Gly He Tyr Thr Lys Val Ser 
435 440 445 

Arg Tyr Val Ash Trp He Lys Glu Lys Thr Lys Leu Thr 
450 455 4 60 
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Fig. 11 
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Fig. 15 
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Fig. 19 
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i MTELKAKGPR APMVAGGPPS 
61 PRPCQGQDPS DEKTQDQQSL 
121 PSGPGQSQPS PPACEVTSSW 
181 AHKVLPRGLS PARQLLLPAS 
241 PRALGGAAAG GG AAA VP PGA 
301 MDFIHVPILP LNHALLAART 
361 CAYPPDAEPK DDAYPLYSDF 
4 21 PPPPLPPRAT PSRPGEAAVT 
4 81 KAPGASGCLL PRDGLPSTSA 
541 YLNYLRPDSE ASQSPQYSFE 
601 YLCAGRNDCI VDKIRRKNCP 
661 GVPNESQALS QRFTFSPGQD 
721 LGERQLLSVV KWSKSLPGFR 
781 DLILNEQRMK ESSFYSLCLT 
841 FEEMRSSYIR ELIKAIGLRQ 
901 LSVEFPEMMS EVIAAQLPKI 
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PEVGSPLLCR PAAGPFPGSQ 
SDVEGAYSRA EATRGAGGSS 
CLFGPELPED PPAAPATQRV 
ESPHWSGAPV KPSPQAAAVE 
AAGGVALVPK EDSRFSAPRV 
RQLLEDESYD GGAGAASAFA 
QPPALKIKEE EEGAEASARS 
AAPASASVSS ASSSGSTLEC 
SAAAAGAAPA LYPALGLNGL 
SLPQKICLIC GDEASGCHYG 
ACRLRKCCQA GMVLGGRKFK 
IQLIPPLINL LMSIEPDVIY 
NLHIDDQITL IQYSWMSLMV 
MWQIPQEFVK LQVSQEEFLC 
KGVVSSSQRF YQLTKLLDNL 
LAGMVKPLLF HKK 



TSDTLPEVSA IPISLDGLLF 
SSPPEKDSGL LCSVLDTLLA 
LSPLMSRSGC KVGDSSGTAA 
VEEEDGSESE ESAGPLLKGK 
ALVEQDAPMA r GRSPLATTV 
PPRSSPCASS TPVAVGDFPD 
PRS YLVAGAN PAAFPDFPLG 
ILYKAEGAPP QQGPFAPPPC 
PQLGYQAAVL KZGLPQVYPP 
VLTCGSCKVF FKRAMEGQHN 
KFNKVRVVRA LDAVALPQPL 
AGHDNTKPDT SSSLLTSLNO 
FGL'GWRSYKH VSGQMLYFAP 
MKVLLLLNTI FLEGLRSQTQ 
HDLVKQLHLY CLNTFIQSRA 
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1 ctqaccagcg ccgccctccc ccgcccccga cccaggaggL ggaqatccct ccggtccagc 

61 cacattcaac acccactttc tcctccctct gcccctatat tcccgaaacc ccctcctccr. 

121 tcccttttcc ctcctccctg gagacggggg aggagaaaag gggagtccag tcqtc atg ac 

181 tgagctgaag gcaaagggtc cccgggctcc ccacgtggcg ggcggcccgc cctcccccga 

24 1 ggtcggatcc ccactgctgt gtcgcccagc cgcaggtccg ttcccgggga gccagacctc 

301 ggacaccttg cctgaagttt cggccatacc tatctccctg gacgggctac tcttccctcg 

361 gccctgccag ggacaggacc cctccgacga aaagacgcag gaccagcagt cgctgtcgga 

421 cgtggagggc gcatattcca gagctgaagc tacaaggggt gctggaggca gcagttctag 

481 tcccccagaa aaggacagcg gactgctgga cagtgtcttg gacactctgt tggcgccctc 

54 1 aggtcccggg cagagccaac ccagccctcc cgcctgcgag gtcaccagct cttggtgcct 

601 gtttggcccc gaacttcccg aagatccacc ggctgccccc gccacccagc gggtgttgtc 

661 cccgctc atg agccggtccg ggtgcaaggt tggagacagc tccgggacgg cagctgccca 

721 taaagtgctg ccccggggcc tgtcaccagc ccggcagctg ctgctcccgg cctctgagag 

781 ccctcactgg tccggggccc cagtgaagcc gtctccgcag gccgctgcgg tggaggttga 

84 1 ggaggaggat ggctctgagt ccgaggagtc tgcgggtccg cttctgaagg gcaaacctcg 

901 ggctctgggt ggcgcggcgg ctggaggagg aqccgcggct gtcccgccgg gggcggcagc 

961 aggaggcgtc gccctggtcc ccaaggaaga ttcccgcttc tcagcgccca gggtcgccct 

1021 ggtggagcag gacgcgccga tggcgcccgg gcgctccccg ctggccacca cggtgatgga 

1081 tttcatccac gtgcctatcc tgcctctcaa tcacgcctta ttggcagccc gcactcggca 

1141 gctgctggaa gacgaaagtt acgacggcgg ggccggggct gccagcgcct ttgccccgcc 

1201 gcggagttca ccctgtgcct cgtccacccc ggtcgctgta ggcgacttcc ccgactgcgc 

1261 gtacccgccc gacgccgagc ccaaggacga cgcgtaccct ctctatagcg acttccagcc 

1321 gcccgctcta aagataaagg aggaggagga aggcgcggag gcctccgcgc gctccccgcg 

1381 ttcctacctt gtggccggtg ccaaccccgc agccttcccg gatttcccgt tggggccacc 

1441 gcccccgctg ccgccgcgag cgaccccatc cagacccggg gaagcggcgg tgacggccgc 

1501 acccgccagt gcctcagtct cgtctgcgtc ctcctcgggg tcgaccctgg agtgcatcct 

1561 gtacaaagcg gagggcgcgc cgccccagca gggcccgttc gcgccgccgc cctgcaaggc 

1621 gccgggcgcg agcggctgcc tgctcccgcg ggacggcctg ccctccacct ccgcctctgc 

1681 cgccgccgcc ggggcggccc ccgcgctcta ccctgcactc ggcctcaacg ggctcccgca 

1741 gctcggctac caggccgccg tgctcaagga gggcctgccg caggtctacc cgccctatct 

1801 caactacctg aggccggatt cagaagccag ccagagccca caatacagct tcgagtcatt 

1861 acctcagaag atttgtttaa tctgtgggga tgaagcatca ggctgtcatt atggtgtcct 

1921 tacctgtggg agctgtaagg tcttctttaa gagggcaatg gaagggcagc acaactactt 

1981 atgtgctgga agaaatgact gcatcgttga taaaatccgc agaaaaaact gcccagcatg 

2041 tcgccttaga aagtgctgtc aggctggcat ggtccttgga ggtcgaaaat ttaaaaagtt 

2101 caataaagtc agagttgtga gagcactgga tgctgttgct ctcccacagc cattgggcgt 

2161 tccaaatgaa agccaagccc taagccagag attcactttt tcaccaggtc aagacataca 

2221 gttgattcca ccactgatca acctgttaat gagcattgaa ccagatgtga tctatgcagg 

2281 acatgacaac acaaaacctg acacctccag ttctttgctg acaagtctta atcaactagg 

2341 cgagaggcaa cttctttcag tagtcaagtg gtctaaatca ttgccaggtt ttcgaaactt 

2401 acatattgat gaccagataa ctctcattca gtattcttgg atgagcttaa tggtgtttgg 

24 61 tctaggatgg agatcctaca aacatgtcag tgggcagatg ctgtattttg cacctgatct 

2521 aatactaaat gaacagcgga tgaaagaatc atcattctat tcattatgcc ttaccatgtg 

2581 gcagatccca caggagtttg tcaagcttca agttagccaa gaagagttcc tctgtatgaa 

2641 agtattgtta cttcttaata caattccttt ggaagggcta cgaagtcaaa cccagtttga 

2701 ggagatgagg tcaagctaca ttagagagct catcaaggca attggtttga ggcaaaaagg 

2761 agttgtgtcg agctcacagc gtttctatca acttacaaaa ettcttgata acttgcatga 

2821 tcttgtcaaa caacttcatc tgtactgctt gaatacattt atccagtccc gggcactgag 

2881 tgttgaattt ccagaaatga tgtctgaagt tattgctgca caattaccca agatattggc 

2941 agggatggtg aaaccccttc tctttcataa 



Fig. 21 



/ 



WO 00/49147 



PCT/EPOO/01368 



SEQUENCE LISTING 
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<130> 000065wo/JH/ml 

10 <140> 
<141> 

<160> 19 

15 <170> Patentln Ver. 2.1 



<210> 1 

<211> 5753 

20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: vector pTGFG36 

<220> 

<221> CDS 

<222> (689) . . (2071) 



25 



30 <400> 1 





cgcgttgaca 


ttgattattg 


actagttatt 


aatagtaatc 


aattacgggg 


tcattagttc 


60 




atagcccata 


tatggagttc 


cgcgttacat 


aacttacggt 


aaatggcccg 


cctggctgac 


120 


35 


cgcccaacga 


cccccgccca 


ttgacgtcaa 


taatgacgta 


tgttcccata 


gtaacgccaa 


180 




tagggacttt 


ccattgacgt 


caatgggtgg 


agtatttacg 


gtaaactgcc 


cacttggcag 


240 


40 


tacatcaagt 


gtatcatatg 


ccaagtacgc 


cccctattga 


cgtcaatgac 


ggtaaatggc 


300 


ccgcctggca 


ttatgcccag 


tacatgacct 


tatgggactt 


tcctacttgg 


cagtacatct 


360 




acgtattagt 


catcgctatt 


accatggtga 


tgcggttttg 


gcagtacatc 


aatgggcgtg 


420 


45 


gatagcggtt 


tgactcacgg 


ggatttccaa 


gtctccaccc 


cattgacgtc 


aatgggagtt 


480 




tgttttggca 


ccaaaatcaa 


cgggactttc 


caaaatgtcg 


taacaactcc 


gccccattga 


540 


50 


cgcaaatggg 


cggtaggcgt 


gtacggtggg 


aggtctatat 


aagcagagct 


ctctggctaa 


600 


ctagagaacc 


cactgcttac 


tggcttatcg 


aaattaatac 


gactcactat 


agggagaccc 


660 


55 


aagcttgcat 


gccaattccg 


caaaggtt atg cag cgc gtg aac atg 
Met Gin Arg Val Asn Met 
1 5 


ate atg 
lie Met 


712 



60 



gca gaa tea cca ggc etc ate acc ate tgc ctt tta gga tat eta etc 760 
Ala Glu Ser Pro Gly Leu lie Thr lie Cys Leu Leu Gly Tyr Leu Leu 
10 15 20 
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856 



904 



952 



1000 



1048 



1096 
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get gec cac tgt gtt gaa act ggt gtt aaa att aca gtt gtc gca ggt 1528 
Ala Ala His Cys Val Glu Thr Gly Val Lys He Thr Val Val Ala Gly 
265 270 275 280 

5 gaa cat aat att gag gag aca gaa cat aca gag caa aag cga aatgtg 1576 . 
Glu His Asn He Glu Glu Thr Glu His Thr Glu Gin Lys Arg Asn Val 
285 290 295 

att cga att att cct cac cac aac tac aat gca get att aat aag tac 1624 
10 He Arg lie lie Pro His His Asn Tyr Asn Ala Ala He Asn Lys Tyr 
300 305 310 

aac cat gac att gec ctt ctg gaa ctg gac gaa ccc tta gtg eta aac 1672 
Asn His Asp lie Ala Leu Leu Glu Leu Asp Glu Pro Leu Val Leu Asn 
15 315 320 325 

age tac gtt aca cct att tgc att get gac aag gaa tac acg aac ate 1720 

Ser Tyr Val Thr Pro He Cys He Ala Asp Lys Glu Tyr Thr Asn He 

330 335 340 

20 

ttc etc aaa ttt gga tct ggc tat gta agt ggc tgg gga aga gtc ttc 1768 

Phe Leu Lys Phe Gly Ser Gly Tyr Val Ser Gly Trp Gly Arg Val Phe 

345 350 355 " 360 

25 cac aaa ggg aga tea get tta gtt ctt cag tac ctt aga gtt cca ctt 1816 
His Lys Gly Arg Ser Ala Leu Val Leu Gin Tyr Leu Arg Val Pro Leu 
365 - 370 375 

gtt gac cga gee aca tgt ctt cga tct aca aag ttc ace ate tat aac 18 64 
30 Val Asp Arg Ala Thr Cys Leu Arg Ser Thr Lys Phe Thr He Tyr Asn 
380 385 390 

aac atg ttc tgt get ggc ttc cat gaa gga ggt aga gat tea tgt caa 1912 
Asn Met Phe Cys Ala Gly Phe His Glu Gly Gly Arg Asp Ser Cys Gin 
35 395 ~ 400 405 

gga gat agt ggg gga ccc cat gtt act gaa gtg gaa ggg ace agt ttc 1960 

Gly Asp Ser Gly Gly Pro His Val Thr Glu Val Glu Gly Thr Ser Phe 

410 415 420 

40 

tta act gga att att age tgg ggt gaa gag tgt gca atg aaa ggc aaa 2008 

Leu Thr Gly He He Ser Trp Gly Glu Glu Cys Ala Met Lys Gly Lys 

425 430 435 440 

45 tat gga ata tat ace aag gta tec egg tat gtc aac tgg att aag gaa 2056 
Tyr Gly lie Tyr Thr Lys Val Ser Arg Tyr Val Asn Trp He Lys Glu 
445 450 . 455 

aaa aca aag etc act taatgggatc ggtcgagegg ccgcgactct actagaggat 2111 
50 Lys Thr Lys Leu Thr 
4 60 

ctttgtgaag gaaccttact tctgtggtgt gacataattg gacaaactac ctacagagat 2171 
55 ttaaagctct aaggtaaata taaaattttt aagtgtataa tgtgttaaac tactgattct 2231 
aattgtttgt gtattttaga ttccaaccta tggaactgat gaatgggagc agtggtggaa 2291 
tgcctttaat gaggaaaacc tgttttgctc agaagaaatg ccatctagtg atgatgaggc 2351 

60 

tactgetgae tctcaacatt ctactcctcc aaaaaagaag agaaaggtag aagaccccaa 2411 
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•iggactttcct 


t cagaattgc 


taagtttttt 


aaatcatrrrt" 

y a y i_ l, a Ly^,L 


y Lyi. l Lay Ld 


d LdgadCLCC 


Z 4 / 1 




tgcttgcttt 


gcta tttaca 


ccacaaagga 


C4 d U CJ1 ^-4 L- I — V_J v*^ CI 


L.i_yL- La LaLd 


cigclclclciLLaL 




5 


^ggaaaaatat 


tctgtaacct 


ttataaataa 




L d LdaLLdLa 


acaLaeLgtt 






ttttcttact 


ccacacaggc 


at agagtgtc 


tactattaaf 


aau La i.y u LL 


aciciclclLL.gL.g 


ZDJl 


10 


tacctttagc 


tttttaattt 


qt aaaacraat 

3 »-«-»«-»-«-*yyyy >- 


taataacrcra^ 


faff f rr =» f rr t" 


dLdgLyCCL L 


Z / J. JL 


-gactagagat 


cataatcagc 


cataccacat 


ttataaaanf 


LL LCLL.L L y L L 


L LdddaaaCC 


9*7 "7 1 




-tcccacacct 


ccccctgaac 


ctgaaacat a 


aaataaatnr 


aatty l ty L L 


yLLaaCLLyt 


zo ji 


15 


ttatt gcagc 


tta taatggt 


t acaaataaa 


acaataorat" 

y v-r u u u a. y L 


lg Laaa L L LL 


duddd Ldddg 






.catttttttc 


actgcattct 


aottataatt 

3 w 3 ^ 3 3 *" 


tgtccaaact 


a v*- ^ — . a a l y l a. 


LLLLoLLdLy 




20 


t ctggat ccc 


cgggtaccct 


ctagagcgaa 


tt aattcact 


yyuL.y L i — y L L 


L LdLddLy L<_ 




qtqactqqqa 


aaaccct ggc 


gttacccaac 


ttaat cgcct 


tocaacarai" 

»— y ^ C* y \_C3L V* CL l_ 


LLLLLL L LL.LJ 


jU ' 1 




ccaqctqqcq 


taa tagcgaa 


gaggcccgca 


ccoatcoccc 

^* ^4 **** v_. 


L LL.OUuuL.ay 


l LLjcgLdgcc 


Jl Jl 


25 


tqaatqqcqa 


atqqcqcctq 


atgcggtatt 


ttct cct t ac 


y wC* l— w l— l_ y L— 


y y L ct L L LLdL 


J i -7 i 




accgcatatg 


gtgcactctc 


agtacaat ct 


gctctgatgc 


cocatafif t*a 

v — - y ^ — - u L CL y t L- CX 


ca y LLayLLLL 


Ji 


30 


.gacacccgcc 


aacacccgct 


gacgcgccct 


gacgggctt g 


t. r.fnrt" cccc% 

l y v_ i_ L- v_. y 


y Ld LLLy L L L 


J Jll 


acagacaagc 


tgtgaccgtc 


tccqqqaact 


gcatgtgt ca 


3 3 3 L L I— LLa 


v — L, y LLa LLdL. 


^"3*7 1 




cgaaacgcgc 


qaqacqaaaq 


qqqqqqtacc 


agcttcgtag 


ctagaacatc 


a Ly l ll '-yyy 


—/ ^ — ' -L 


35 


atatcagctt 


cgtagctaga 


acatcatgtt 


ctggt acccc 


cct cert cm t a. 

l^ v — l- y i_ y a i_ a 


cctcci~at"t~1-1~ 

L,y LL L a L L L L 


J *i -7 -L 




tataggttaa 


tgt cat gat a 


at aatggttt 


cttaaacat c 


oyy l y y l a l l 


L L LLy y y y aa 


^ R R 1 
j j j J. 


40 


atqtqcqcqq 


aacccctatt 


tgtttatttt 


tctaaataca 


ttcaaatata 


LCI L L L y L LLCt 


«J L> J. X 


tgagacaata 


accctgat aa 


atgcttcaat 


aatattgaaa 


aaooaaoarrt 

u ca y y ci a y a y k. 


a Ly ay L a. L ll 


"3^7 1 

JV / i 




aacatttccg 


tgtcgccctt 


attccctttt 


ttacaacatt 


ttcrrT*tt cct 

1- l- y \_- L- U L LL L 


yLLLLLyLLL 


j / 


45 


acccagaaac 


qctqqtqaaa 

3 3 3 3 


gtaaaagatg 


ctgaaga t ca 


att Oct nt" ocpi 

y l y y y v— y v_ ci 


Ly ay l y y y l L 


17 Ql 




acatcgaact 


ggatctcaac 


aqcqqtaaqa 


tcct tgagag 


ttttcgcccc 


y ca a y a a. l. y l l. 


38 51 


50 


ttccaatgat 


gagcactttt 


aaagtt ctgc 


tatgtggcgc 


aatattatrr 

y y lu l. l a. i_ \^ v— 


Ly La l Lyauy 




ccgggcaaga 


gcaactcggt 


cgccgcatac 


actattctca 


gaatgacttg 


gttgagtact 


3971 




caccagtcac 


agaaaagcat 


cttacggatg 


gcatgacagt 


aagagaatta 


tgcagtgctg 


4031 


55 


ccataaccat 


gagtgataac 


actgcggcca 


acttacttct 


gacaacgatc 


ggaggaccga 


4091 




aggagctaac 


cgcttttttg 


cacaacatgg 


gggatcatgt 


aactcgcctt 


gatcgttggg 


4151 


60 


aaccggagct 


gaatgaagcc 


ataccaaacg 


acgagcgtga 


caccacgatg 


cctgtagcaa 


4211 


tggcaacaac 


gttgcgcaaa 


ctattaactg 


gcgaactact 


tactctagct 


tcccggcaac 


4271 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc 4 331 
cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca 4 39! 
ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga 4451 
gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta 4511 
agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc 4571 
atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc 4 631 
cttaacgtga gttttcgttc cactgagegt cagaccccgt agaaaagatc aaaggatctt 4 691 
cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac 4 751 
cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct 4811 
tcagcagagc gcagatacca aatactgttc ttctagtgta gccgtagtta ggccaccact 4871 
tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg 4 931 
ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata 4 991 
aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga 5051 
cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag 5111 
ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg 5171 
agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac. 5231 
ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca 5291 
acgcggcctt tttacggttc ctggcctttt gctggccttt €qctcacatg ttctttcctg 5351 
cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc 5411 
gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa 5471 
tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt 5531 
ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt 5591 
-aggcacccca ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg 5651 
gataacaatt tcacacagga aacagctatg accatgatta cgccaagctc tctagagctc 5711 
tagagctcta gagctctaga gagcttgcat gcctgcaggt eg 5753 

<210> 2 
<211> 461 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: vector pTGFG36 
<400> 2 

Met Gin Arg Val Asn Met lie Met Ala Glu Ser Pro Gly Leu He Thr 
1 5 10 



15 
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He Cys Leo Leu Gly Tyr Leu Leu Ser Ala Glu Cys Thr Val Phe Leu 
20 25 30 

Asp His Glu Asn Ala Asn Lys He Leu Asn Arg Pro Lys Arg Tyr Asn 

40 45 

Ser Gly Lys Leu Glu Glu Phe Val Gin Gly Asn Leu Glu Arg Glu Cys 
5U 55 60 

10 Met Glu Glu Lys Cys Ser Phe Glu Glu Ala Arg Glu Val Phe Glu Asn 

70 75 80 

Thr Glu Arg Thr Thr Glu Phe Trp Lys Gin Tyr Val Asp Gly Asp Gin 
15 85 90 95 

Cys Glu Ser Asn Pro Cys Leu Asn Gly Gly Ser Cys Lys Asp Asp He 
100 105 110 

Asn Ser Tyr Glu Cys Trp Cys Pro Phe Gly Phe Glu Gly Lys Asn Cys 
- LJ - D 120 12 5 

Glu Leu Asp Val Thr Cys Asn He Lys Asn Gly Arg Cys Glu Gin Phe 
iJU 135 140 

25 Cys Lys Asn Ser Ala Asp Asn Lys Val Val Cys Ser Cys Thr Glu Glv 
145 150 155 ' i 6 o 

Tyr Arg Leu Ala Glu Ash Gin Lys Ser Cys Glu Pro Ala Val Pro Phe 
30 165 170 175 

Pro Cys Gly Arg Val Ser Val Ser Gin Thr Ser Lys Leu Thr Arg Ala 
180 185 i9 0 

Glu Thr Val Phe Pro Asp Val Asp Tyr Val Asn Ser Thr Glu Ala Glu 
195 .200 205 

Thr lie Leu Asp Asn He Thr Gin Ser Thr Gin Ser Phe Asn Asp Phe 
.c 1 u 215 



220 



40 Thr Arg Val Val Gly Gly Glu Asp Ala Lys Pro Gly Gin Phe Pro Trp 
225 230 235 240 

Gin Val Val Leu Asn Gly Lys Val Asp Ala Phe Cys Gly Gly Ser He 
45 245 2 50 255 

Val Asn' Glu Lys Trp lie Val Thr Ala Ala His Cys Val Glu Thr Glv 
260 265 270 

Val Lys lie Thr Val Val Ala Gly Glu His Asn He Glu Glu Thr Glu 
50 2? 5 280 285 

His Thr Glu Gin Lys Arg Asn Val He Arg He He Pro His His Asn 
290 295 



300 



55 Tyr Asn Ala Ala He Asn Lys Tyr Asn His Asp He Ala Leu Leu Glu 

J05 310 315 - 320 

Leu Asp Glu Pro Leu Val Leu Asn Ser Tyr Val Thr Pro He Cys He 

60 325 330 335 

Ala Asp Lys Glu Tyr Thr Asn He Phe Leu Lys Phe Gly Ser Glv Tvr 

340 345 350 Y 
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Val Ser Gly Trp Gly Arg Val Phe His Lys Gly Arg Ser Ala Leu Val 

355 360 365 

Leu Gin Tyr Leu Arg Val Pro Leu Val Asp Arg Ala Thr Cys Leu Arg 

370 375 " 380 

Ser Thr Lys Phe Thr lie Tyr Asn Asn Met Phe Cys Ala Gly Phe His 

385 390 395 ' 400 

Glu Gly Gly Arg Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro His Val 
405 410 " 415 



Thr Glu Val Glu Gly Thr Ser Phe Leu Thr Gly lie lie Ser Trp Gly 
15 420 425 430 

Glu Glu Cys Ala Met Lys Gly Lys Tyr Gly He Tyr Thr Lys Val Ser 
435 440 445 

20 Arg Tyr Val Asn Trp He Lys Glu Lys Thr Lys Leu Thr 
450 455 " 460 

<210> 3 

25 <211> 78 

<212> DNA 

<213> Homo sapiens 

<400> 3 

30 ggggtaccag cttcgtagct agaacatcat gttctgggat atcagcttcg tagctagaac 60 
atcatgttct ggtacccc 78 

<210> 4 

35 <211> 78 

<212> DNA 

<213> Homo sapiens 

<400> 4 

40 ggggtaccag aacatgatgt tctagctacg aagctgatat cccagaacat gatgttctag 60 
ctacgaagct ggtacccc ~ 7 8 

<210> 5 

45 <211> 19 . 

<212> DNA 

<213> Homo sapiens 

<400> 5 

50 agcttgacct cgagcaagc 19 

<210> 6 
<211> 19 
55 <212> DNA 

<213> Homo sapiens 



60 



<400> 6 

ggccgcttgc tcgaggtca 19 
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<210> 7 

<211> 43 

<212> DNA 

<213> Homo sapiens 

<400> 7 

ggaattccgc aaaggttatg cagcgcgtga acatgatcat ggc 

<210> 8 

<211> 39 

<212> DNA 

<213> Homo sapiens 

<400> 8 

cgcggatcca ttaagtgagc tttgtttttt ccttaatcc 

<210> 9 
<211> 26 
<212> DNA 

<213> Homo sapiens 
<400> 9 

cgaggatcca gtcgtcatga ctgagc 



43 



39 



26 



<210> 10 

<211> 41 

30 <212> DNA 

<213> Homo sapiens 

<400> 10 

gcagaattca ttataaaaac tcaagacctc ataatcctga c 



41 



40 



45 



50 



<210> 11 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 11 

ctcctcgggg tcgaccctgg 

<210> 12 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 12 

ccagggtcga ccccgaggag 



20 



20 



55 <210> 13 

<211> 5905 

<212> DNA 

<213> Artificial Sequence 
60 <220> 



<223> Description of Artificial Sequence: vector pTGFG53 
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<400> 13 

cgcgttgaca ttgattattg 
atagcccata tatggagttc 
cgcccaacga cccccgccca 
5 tagggacttt ccattgacgt 
tacatcaagt gtatcatatg 
ccgcctggca ttatgcccag 
acgtattagt catcgctatt 
gatagcggtt tgactcacgg 

10 tgttttggca ccaaaatcaa 
cgcaaatggg cggtaggcgt 
ctagagaacc cactgcttac 
aagcttgcat gccaattccg 
accaggcctc atcaccatct 

1 5 tcttgatcat gaaaacgcca 
attggaagag tttgttcaag 
tgaagaagca cgagaagttt 
tgttgatgga gatcagtgtg 
cattaattcc tatgaatgtt 

20 tgtaacatgt aacattaaga 
caaggtggtt tgctcctgta 
accagcagtg ccatttccat 
tgctgagact gtttttcctg 
ggataacatc actcaaagca 

25 agatgccaaa ccaggtcaat 
ctgtggaggc tctatcgtta 
tggtgttaaa attacagttg 
gcaaaagcga aatgtgattc 
gtacaaccat gacattgccc 

30 tacacctatt tgcattgctg 
ctatgtaagt ggctggggaa 
ccttagagtt ccacttgttg 
taacaacatg ttctgtgctg 
tgggggaccc catgttactg 

35 gggtgaagag tgtgcaatga 
caactggatt aaggaaaaaa 
tactagagga tctttgtgaa 
cctacagaga tttaaagctc 
ctactgattc taattgtttg 

40 cagtggtgga atgcctttaa 
gatgatgagg ctactgctga 
gaagacccca aggactttcc 
aatagaactc ttgcttgctt 
aagaaaatta tggaaaaata 

45 aacatactgt tttttcttac 
caaaaattgt gtacctttag 
tatagtgcct tgactagaga 
tttaaaaaac ctcccacacc 
tgttaacttg tttattgcag 

50 cacaaataaa gcattttttt 
atcttatcat gtctggatcc 
ggatatcagc ttcgtagcta 
ttcactggcc gtcgttttac 
tcgccttgca gcacatcccc 

55 tcgcccttcc caacagttgc 
ccttacgcat ctgtgcggta 
tgatgccgca tagttaagcc 
ggcttgtctg ctcccggcat 
gtgtcagagg ttttcaccgt 

60 acatgatgtt ctagctacga 
gtaccccggc ctcgtgatac 
ttagacgtca ggtggcactt 



actagttatt aatagtaatc 
cgcgttacat aacttacggt 
ttgacgtcaa taatgacgta 
caatgggtgg agtatttacg 
ccaagtacgc cccctattga 
tacatgacct tatgggactt 
accatggtga tgcggttttg 
ggatttccaa gtctccaccc 
cgggactttc caaaatgtcg 
gtacggtggg aggtctatat 
tggcttatcg aaattaatac 
caaaggttat gcagcgcgtg 
gccttttagg atatctactc 
acaaaattct gaatcggcca 
ggaaccttga gagagaatgt 
ttgaaaacac tgaaagaaca 
agtccaatcc atgtttaaat 
ggtgtccctt tggatttgaa 
atggcagatg cgagcagttt 
ctgagggata tcgacttgca 
gtggaagagt ttctgtttca 
atgtggacta tgtaaattct 
cccaatcatt taatgacttc 
tcccttggca ggttgttttg 
atgaaaaatg gattgtaact 
tcgcaggtga acataatatt 
gaattattcc tcaccacaac 
ttctggaact ggacgaaccc 
acaaggaata cacgaacatc 
gagtcttcca caaagggaga 
accgagccac atgtcttcga 
gcttccatga aggaggtaga 
aagtggaagg gaccagtttc 
aaggcaaata tggaatatat 
caaagctcac ttaatgggat 
ggaaccttac ttctgtggtg 
taaggtaaat ataaaatttt 
tgtattttag attccaacct 
tgaggaaaac ctgttttgct 
ctctcaacat tctactcctc 
ttcagaattg ctaagttttt 
tgctatttac accacaaagg 
ttctgtaacc tttataagta 
tccacacagg catagagtgt 
ctttttaatt tgtaaagggg 
tcataatcag ccataccaca 
tccccctgaa cctgaaacat 
cttataatgg ttacaaataa 
cactgcattc tagttgtggt 
ccggggggta ccagcttcgt 
gaacatcatg ttctggtacc 
aacgtcgtga ctgggaaaac 
ctttcgccag ctggcgtaat 
gcagcctgaa tggcgaatgg 
tttcacaccg catatggtgc 
agccccgaca cccgccaaca 
ccgcttacag acaagctgtg 
catcaccgaa acgcgcgaga 
agctgatatc ccagaacatg 
gcctattttt ataggttaat 
ttcggggaaa tgtgcgcgga 



aattacgggg tcattagttc 60 
aaatggcccg cctggctgac 120 
tgttcccata gtaacgccaa 180 
gtaaactgcc cacttggcag 240 
cgtcaatgac ggtaaatggc 300 
tcctacttgg cagtacatct 360 
gcagtacatc aatgggcgtg 420 
cattgacgtc aatgggagtt 480 
taacaactcc gccccattga 540 
aagcagagct ctctggctaa 600 
gactcactat agggagaccc 660 
aacatgatca tggcagaatc 720 
agtgctgaat gtacagtttt 780 
aagaggtata attcaggtaa 840 
atggaagaaa agtgtagttt 900 
actgaatttt ggaagcagta 960 
ggcggcagtt gcaaggatga 1020 
ggaaagaact gtgaattaga 1080 
tgtaaaaata gtgctgataa 1140 
gaaaaccaga agtcctgtga 1200 
caaacttcta agctcacccg 1260 
actgaagctg aaaccatttt 1320 
actcgggttg ttggtggaga 1380 
aatggtaaag ttgatgcatt 1440 
gctgcccact gtgttgaaac 1500 
gaggagacag aacatacaga 1560 
tacaatgcag ctattaataa 1620 
ttagtgctaa acagctacgt 1680 
ttcctcaaat ttggatctgg 1-7-40 
tcagctttag ttcttcagta 1800 
tctacaaagt tcaccatcta 1860 
gattcatgtc aaggagatag 1920 
ttaactggaa ttattagctg 1980 
accaaggtat cccggtatgt 2040 
cggtcgagcg gccgcgactc 2100 
tgacataatt ggacaaacta 2160 
taagtgtata atgtgttaaa 2220 
atggaactga tgaatgggag 2280 
cagaagaaat gccatctagt 2340 
caaaaaagaa gagaaaggta 2400 
tgagtcatgc tgtgtttagt 24 60 
aaaaagctgc actgctatac 2520 
ggcataacag ttataatcat 2580 
ctgctattaa taactatgct 2640 
ttaataagga atatttgatg 2700 
tttgtagagg ttttacttgc 2760 
aaaatgaatg caattgttgt 2820 
agcaatagca tcacaaattt 2880 
ttgtccaaac tcatcaatgt 2940 
agctagaaca tcatgttctg 3000 
cccgctctag agcgaattaa 3060 
cctggcgtta cccaacttaa 3120 
agcgaagagg cccgcaccga 3180 
cgcctgatgc ggtattttct 3240 
actctcagta caatctgctc 3300 
cccgctgacg cgccctgacg 3360 
accgtctccg ggagctgcat 3420 
cgaaagggcg gggtaccaga 34 80 
atgttctagc tacgaagctg 3540 
gtcatgataa taatggtttc 3600 
acccctattt gtttattttt 3660 
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ctaaatacat 

atattgaaaa 

tgcggcattt 

tgaagatcag 

cct tgagagt 

atgtggcgcg 

ctattctcag 

catgacagta 

cttacttctg 

ggatcatgta 

cgagcgtgac 

cgaactactt 

tgcaggacca 

agccggtgag 

ccgtatcgta 

gatcgctgag 

atatatactt 

cctttttgat 

agaccccgta 

ctgcttgcaa 

accaactctt 

tctagtgtag 

cgctctgcta 

gttggactca 

tgcacacagc 

ctatgagaaa 

agggtcggaa 

agtcctgtcg 

gggcggagcc 

tggccttttg 

accgcctttg 

gtgagcgagg 

attcattaat 

gcaattaatg 

gctcgtatgt 

catgattacg 

cggggtacca 

catcatgttc 



tcaaatatgt 

aggaagagta 

tgccttcctg 

ttgggtgcac 

tttcgccccg 

gtattatccc 

aatgacttgg 

agagaattat 

acaacgatcg 

actcgccttg 

accacgatgc 

actctagctt 

cttctgcgct 

cgtgggtctc 

gttatctaca 

ataggtgcct 

tagattgatt 

aatctcatga 

gaaaagatca 

acaaaaaaac 

tttccgaagg 

ccgtagttag 

atcctgttac 

agacgatagt 

ccagcttgga 

gcgccacgct 

caggagagcg 

ggtttcgcca 

tatggaaaaa 

ctcacatgtt 

agtgagctga 

aagcggaaga 

gcagctggca 

tgagttagct 

tgtgtggaat 

ccaagctctc 

gcttcgtagc 

tggtaccccg 



atccgctcat 

tgagtattca 

tttttgctca 

gagtgggtta 

aagaacgttt 

gtattgacgc 

ttgagtactc 

gcagtgctgc 

gaggaccgaa 

atcgttggga 

ctgtagcaat 

cccggcaaca 

cggcccttcc 

gcggtatcat 

cgacggggag 

cactgattaa 

taaaacttca 

ccaaaatccc 

aaggatcttc 

caccgctacc 

taactggctt 

gccaccactt 

cagtggctgc 

tacggataag 

gcgaacgacc 

tcccgaaggg 

cacgagggag 

cctctgactt 

cgccagcaac 

ctttcctgcg 

taccgctcgc 

gcgcccaata 

cgacaggttt 

cactcattag 

tgtgagcgga 

tagagctcta 

tagaacatca 

gtcga 



gagacaataa 

acatttccgt 

cccagaaacg 

catcgaactg 

tccaatgatg 

cgggcaagag 

accagtcaca 

cataaccatg 

ggagctaacc 

accggagctg 

ggcaacaacg 

attaatagac 

ggctggctgg 

tgcagcactg 

tcaggcaact 

gcattggtaa 

tttttaattt 

ttaacgtgag 

ttgagatcct 

agcggtggtt 

cagcagagcg 

caagaactct 

tgccagtggc 

gcgcagcggt 

tacaccgaac 

agaaaggcgg 

ctt ccagggg 

gagcgtcgat 

gcggcctttt 

ttatcccctg 

cgcagccgaa 

cgcaaaccgc 

cccgactgga 

gcaccccagg 

taacaatttc 

gagctctaga 

tgttctggga 



ccctgataaa 

gtcgccctta 

ctggtgaaag 

gatctcaaca 

agcactttta 

caactcggtc 

gaaaagcatc 

agtgataaca 

gcttttttgc 

aatgaagcca 

ttgcgcaaac 

tggatggagg 

tttattgctg 

gggccagatg 

atggatgaac 

ctgtcagacc 

aaaaggatct 

ttttcgttcc 

ttttttctgc 

tgtttgccgg 

cagataccaa 

gtagcaccgc 

gataagtcgt 

cgggctgaac 

tgagatacct 

acaggtatcc 

gaaacgcctg 

ttttgtgatg 

tacggttcct 

attctgtgga 

cgaccgagcg 

ctctccccgc 

aagcgggcag 

ctttacactt 

acacaggaaa 

gctctagaga 

tatcagcttc 



tgcttcaata 

ttcccttttt 

taaaagatgc 

gcggtaagat 

aagttctgct 

gccgcataca 

ttacggatgg 

ctgcggccaa 

acaacatggg 

taccaaacga 

tattaactgg 

cggataaagt 

ataaatctgg 

gtaagccctc 

gaaatagaca 

aagtttactc 

aggtgaagat 

actgagcgtc 

gcgtaatctg 

atcaagagct 

atactgtcct 

ctacatacct 

gtcttaccgg 

ggggggttcg 

acagcgtgag 
ggtaagcggc 
gtatctttat 
ctcgtcaggg 
ggccttttgc 
taaccgtatt 
cagcgagtca 
gcgttggccg 
tgagcgcaac 
tatgcttccg 
cagctatgac 
gcttgcatgc 
gtagctagaa 



3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4 620 

4680 

4740 

4 8 00 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

52 80 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5905 



<210> 14 
<211> 6052 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 



Sequence: vector pTGFG64 



<400> 14 

cgcgttgaca 

atagcccata 

cgcccaacga 

tagggacttt 

tacatcaagt 

ccgcctggca 

acgtattagt 

gatagcggtt 

tgttttggca 

cgcaaatggg 

ctagagaacc 

aagcttgcat 

accaggcctc 



ttgattattg 
tatggagttc 
cccccgccca 
ccattgacgt 
gtatcatatg 
ttatgcccag 
catcgctatt 
tgactcacgg 
ccaaaatcaa 
cggtaggcgt 
cactgcttac 
gccaattccg 
atcaccatct 



actagttatt 
cgcgttacat 
ttgacgtcaa 
caatgggtgg 
ccaagtacgc 
tacatgacct 
accatggtga 
ggatttccaa 
cgggactttc 
gtacggtggg 
tggcttatcg 
caaaggttat 
gccttttagg 



aatagtaatc 
aacttacggt 
taatgacgta 
agtatttacg 
cccctattga 
tatgggactt 
tgcggttttg 
gtctccaccc 
caaaatgtcg 
aggtctatat 
aaattaatac 
gcagcgcgtg 
atatctactc 



aattacgggg 
aaatggcccg 
tgttcccata 
gtaaactgcc 
cgtcaatgac 
tcctacttgg 
gcagtacatc 
cattgacgtc 
taacaactcc 
aagcagagct 
gactcactat 
aacatgatca 
agtgctgaat 



tcattagttc 
cctggctgac 
gtaacgccaa 
cacttggcag 
ggtaaatggc 
cagtacatct 
aatgggcgtg 
aatgggagtt 
gccccattga 
ctctggctaa 
agggagaccc 
tggcagaatc 
gtacagtttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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tcttgatcat 

attggaagag 

tgaagaagca 

tgttgatgga 

cattaattcc 

tgtaacatgt 

caaggtggtt 

accagcagtg 

tgctgagact 

ggataacatc 

agatgccaaa 

ctgtggaggc 

tggtgttaaa 

gcaaaagcga 

gtacaaccat 

tacacctatt 

ctatgtaagt 

ccttagagtt 

taacaacatg 

tqqgggaccc 

gggtgaagag 

caactggatt 

tactagagga 

cctacagaga 

ctactgattc 

cagtggtgga 

gatgatgagg 

gaagacccca 

aatagaactc 

aagaaaatta 

aacatactgt 

caaaaattgt 

tatagtgcct 

tttaaaaaac 

tgttaacttg 

cacaaataaa 

atcttatcat 

ggatatcagc 

tcactggccg 

cgccttgcag 

cgcccttccc 

agaacatcat 

ctgatgcggt 

ctcagtacaa 

gctgacgcgc 

gtctccggga 

aagggcacca 

gctacgaagc 

ataatggttt 

tgtttatttt 

atgcttcaat 

attccctttt 

gtaaaagatg 

agcggtaaga 

aaagttctgc 

cgccgcatac 

cttacggatg 

actgcggcca 

cacaacatgg 

ataccaaacg 

ctattaactg 

gcggataaag 



gaaaacgcca 

tttgttcaag 

cgagaagttt 

gatcagtgtg 

tatgaatgtt 

aacattaaga 

tgctcctgta 

ccatttccat 

gtttttcctg 

actcaaagca 

ccaggtcaat 

tctatcgtta 

attacagttg 

aatgtgattc 

gacattgccc 

tgcattgctg 

ggctggggaa 

ccacttgttg 

ttctgtgctg 

catgttactg 

tgtgcaatga 

aaggaaaaaa 

tctttgtgaa 

tttaaagctc 

taattgtttg 

atgcctttaa 

ctactgctga 

aggactttcc 

ttgcttgctt 

tggaaaaata. 

tttttcttac 

gtacctttag 

tgactagaga 

ctcccacacc 

tttattgcag 

gcattttttt 

gtctggatcc 

ttcgtagcta 

tcgttttaca 

cacatccccc 

aacagttgcg 

gttctgggat 

attttctcct * 

tctgctctga 

cctgacgggc 

gctgcatgtg 

gaacatgatg 

tggtaccccg 

cttagacgtc 

tctaaataca 

aatattgaaa 

ttgcggcatt 

ctgaagatca 

tccttgagag 

tatgtggcgc 

actattctca 

gcatgacagt 

acttacttct 

gggatcatgt 

acgagcgtga 

gcgaactact 

ttgcaggacc 



acaaaattct 

ggaaccttga 

ttgaaaacac 

agtccaatcc 

ggtgtccctt 

atggcagatg 

ctgagggata 

gtggaagagt 

atgtggacta 

cccaatcatt 

tcccttggca 

atgaaaaatg 

tcgcaggtga 

gaattattcc 

ttctggaact 

acaaggaata 

gagtcttcca 

accgagccac 

gcttccatga 

aagtggaagg 

aaggcaaata 

caaagctcac 

ggaaccttac 

taaggtaaat 

tgtattttag 

tgaggaaaac 

ctctcaacat 

ttcagaattg 

tgctatttac 

ttctgtaacc 

tccacacagg 

ctttttaatt 

tcataatcag 

tccccctgaa 

cttataatgg 

cactgcattc 

ccggggggta 

gaacatcatg 

acgtcgtgac 

tttcgccagc 

cagcctgaat 

atcagcttcg 

tacgcatctg 

tgccgcatag 

ttgtctgctc 

tcagaggttt 

ttctagctac 

cctcgtgata 

aggtggcact 

ttcaaatatg 

aaggaagagt 

ttgccttcct 

gttgggtgca 

ttttcgcccc 

ggtattatcc 

gaatgacttg 

aagagaatta 

gacaacgatc 

aactcgcctt 

caccacgatg 

tactctagct 

acttctgcgc 



gaatcggcca 

gagagaatgt 

tgaaagaaca 

atgtttaaat 

tggatttgaa 

cgagcagttt 

tcgacttgca 

ttctgtttca 

tgtaaattct 

taatgacttc 

ggttgttttg 

gattgtaact 

acataatatt 

tcaccacaac 

ggacgaaccc 

cacgaacatc 

caaagggaga 

atgtcttcga 

aggaggtaga 

gaccagtttc 

tggaatatat 

ttaatgggat 

ttctgtggtg 

ataaaatttt 

attccaacct 

ctgttttgct 

tctactcctc 

ctaagttttt 

accacaaagg 

tttataagta 

catagagtgt 

tgtaaagggg 

ccataccaca 

cctgaaacat 

ttacaaataa 

tagttgtggt 

ccagcttcgt 

ttctggtacc 

tgggaaaacc 

tggcgtaata 

ggcgaatggc 

tagctagaac 

tgcggtattt 

ttaagccagc 

ccggcatccg 

tcaccgtcat 

gaagctgata 

cgcctatttt 

tttcggggaa 

tatccgctca 

atgagtattc 

gtttttgctc 

cgagtgggtt 

gaagaacgtt 

cgtattgacg 

gttgagtact 

tgcagtgctg 

ggaggaccga 

gatcgttggg 

cctgtagcaa 

tcecggcaac 

tcggcccttc 



aagaggtata 

atggaagaaa 

actgaatttt 

ggcggcagtt 

ggaaagaact 

tgtaaaaata 

gaaaaccaga 

caaacttcta 

actgaagctg 

actcgggttg 

aatggtaaag 

gctgcccact 

gaggagacag 

tacaatgcag 

ttagtgctaa 

ttcctcaaat 

tcagctttag 

tctacaaagt 

gattcatgtc 

ttaactggaa 

accaaggtat 

cggtcgagcg 

tgacataatt 

taagtgtata 

atggaactga 

cagaagaaat 

caaaaaagaa 

•tgagt'catgc 

aaaaagctgc 

ggcataacag 

ctgctattaa 

ttaataagga 

tttgtagagg 

aaaatgaatg 

agcaatagca 

ttgtccaaac 

agctagaaca 

cccctctaga 

ctggcgttac 

gcgaagaggc 

ggggtaccag 

atcatgttct 

cacaccgcat 

cccgacaccc 

cttacagaca 

caccgaaacg 

tcccagaaca 

tataggttaa 

atgtgcgcgg 

tgagacaata 

aacatttccg 

acccagaaac 

acatcgaact 

ttccaatgat 

ccgggcaaga 

caccagtcac 

ccataaccat 

aggagctaac 

aaccggagct 

tggcaacaac 

aattaataga 

cggctggctg 



attcaggtaa 

agtgtagttt 

ggaagcagta 

gcaaggatga 

gtgaattaga 

gtgctgataa 

agtcctgtga 

agctcacccg 

aaaccatttt 

ttggtggaga 

ttgatgcatt 

gtgttgaaac 

aacatacaga 

ctattaataa 

acagctacgt 

ttggatctgg 

ttcttcagta 

tcaccatcta 

aaggagatag 

ttattagctg 

cccggtatgt 

gccgcgactc 

ggacaaacta 

atgtgttaaa 

tgaatgggag 

gccatctagt 

gagaaaggta 

tgtgtttagt 

actgctatac 

ttataatcat 

taactatgct 

atatttgatg 

ttttacttgc 

caattgttgt 

tcacaaattt 

tcatcaatgt 

tcatgttctg 

gcgaattaat 

ccaacttaat 

ccgcaccgat 

cttcgtagct 

ggtaccccgc 

atggtgcact 

gccaacaccc 

agctgtgacc 

cgcgagacga 

tgatgttcta 

tgtcatgata 

aacccctatt 

accctgataa 

tgtcgccctt 

gctggtgaaa 

ggatctcaac 

gagcactttt 

gcaactcggt 

agaaaagcat 

gagtgataac 

cgcttttttg 

gaatgaagcc 

gttgcgcaaa 

ctggat ggag 

gtttattgct 



840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

354 0 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 
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gataaatctg gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat 4 560 

ggtaagccct cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa 4 620 

cgaaatagac agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac 4 680 

caagtttact catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc 4740 

taggtgaaga tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc 4800 

cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg 4860 

cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg 4 920 

gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca 4 980 

aatactgtcc ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg 504 0 

cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg 5100 

tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga 5160 

acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac 5220 

ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat 5280 

ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc 5340 

tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga 54 00 

tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc 54 60 

ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg 5520 

gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag 5580 

cgcagcgagt cagtgagcga ggggtaccag aacatgatgt tctagctacg aagctgatat 5640 

cccagaacat gatgttctag ctacgaagct ggtaccccag cggaagagcg cccaatacgc 5700 

aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc 5760 

gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca 5820 
ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt gagcgqataa 



5880 



caatttcaca caggaaacag ctatgaccat gattacgcca agctctctag agctctagag 5940 
ctctagagct ctagagagct tgcatgccgg ggtaccagct tcgtagctag aacatcatgt 6000 
tctgggatat cagcttcgta gctagaacat catgttctgg taccccggtc ga 6052 



<210> 15 
30 <211> 4344 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: vector pTGFG67 
<400> 15 

cgcgttgaca ttgattattg actagttatt aatagtaatc aattacgggg tcattagttc 60 

atagcccata tatggagttc cgcgttacat aacttacggt aaatggcccg cctggctgac 120 

cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata gtaacgccaa 180 

tagggacttt ccattgacgt caatgggtgg agtatttacg gtaaactgcc cacttggcag 240 

tacatcaagt gtatcatatg . ccaagtacgc cccctattga cgtcaatgac ggtaaatggc 300 

ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg cagtacatct 360 

acgtattagt catcgctatt accatggtga tgcggttttg gcagtacatc aatgggcgtg 4 20 

gatagcggtt tgactcacgg ggatttccaa gtctccaccc cattgacgtc aatgggagtt 4 80 

tgttttggca ccaaaatcaa cgggactttc caaaatgtcg taacaactcc gccccattga 540 

cgcaaatggg cggtaggcgt gtacggtggg aggtctatat aagcagagct ctctggctaa 600 

ctagagaacc cactgcttac tggcttatcg aaattaatac gactcactat agggagaccc 660 

aagcttgacc tcgagcaagc ggccgcgact ctactagagg atctttgtga aggaacctta 720 

cttctgtggt gtgacataat tggacaaact acctacagag atttaaagct ctaaggtaaa 780 

tataaaattt ttaagtgtat aatgtgttaa actactgatt ctaattgttt gtgtatttta 840 

gattccaacc tatggaactg atgaatggga gcagtggtgg aatgccttta atgaggaaaa 900 

cctgttttgc tcagaagaaa tgccatctag tgatgatgag gctactgctg actctcaaca 960 

ttctactcct ccaaaaaaga agagaaaggt agaagacccc aaggactttc cttcagaatt 1020 

gctaagtttt ttgagtcatg ctgtgtttag taatagaact cttgcttgct ttgctattta 1080 

caccacaaag gaaaaagctg cactgctata caagaaaatt atggaaaaat attctgtaac 1140 

ctttataagt aggcataaca gttataatca taacatactg ttttttctta ctccacacag 1200 

gcatagagtg tctgctatta ataactatgc tcaaaaattg tgtaccttta gctttttaat 1260 

ttgtaaaggg gttaataagg aatatttgat gtatagtgcc ttgactagag atcataatca 1320 

gcc , a , taccac atttgtagag gttttacttg ctttaaaaaa cctcccacac ctccccctga 1380 

acctgaaaca taaaatgaat gcaattgttg ttgttaactt gtttattgca gcttataatg 1440 

gttacaaata aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt 1500 
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ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca tgtctggatc cccaaatacc isfin 
ctctagagcg aattaattca ctggccgtcg ttttacaacg tcgtgac^gg gaala?cctg lltn 
gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg cgta Sa itln 
aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatqgc qaataacacc ll*n 
5 tgatgcggta ttttctcctt acgcatctgt gcggtatttc aca C gcaL ?ggtgcactc JsoS 
tcagtacaat ctgctctgat gccgcatagt taagccagcc ccgacLccg ccaacacccq Js60 
ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtqaccg Uto 
tctccgggag ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc qcqaqlcaal 1980 
agggggggta ccagcttcgt agctagaaca tcatgttctg ggaLtcagc ?tcq?aqcta 2040 
10 gaacatcatg ttctggtacc cccctcgtga tacgcctatt ?ttataggtt aatcKataa llio 
""" agaC9 tc »Wt OT ca cttttcgggg aaatgtjcgc ggalccSa l\lo 
tttgtttatt tttctaaata cattcaaata tgtatccgct catgaqacaa taaccctaat ???n 

IttllTaTr tln tatt9a gtatga *^ tcaacatttc cjtgtcgccc 2280 

ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctqqtqa 2340 
acaoco^r tg ^ tga ^ at "gttgggtg cacgagtggg ttacatcgaa ctgga?c^ca 2400 
trtlt^ftt ^tccttgag agttttcgcc ccgaagaacg ttttccaatg atJZgcactt 2460 
ttaaagttct gctatgtggc gcggtattat cccgtattga cgccgggcaa qaqcaactca 2520 
gtcgccgcat acactattct cagaatgact tggttgagL c?caccagtc aSagaaaag? 25sS 
atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgaqtqa?a 2640 

20 acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta accg?t??tt 2100 
tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggaq ctqlatqaaq 2760^ 
ccataccaaa cgacgagcgt gacaccacga tgcctgtagc altggcalca acgttgcgcl Uto 
aactattaac tggcgaacta cttactctag cttcccggca acaattaata gactqqa?qq 2880 

25 c?^?f taa agttgcagga ccacttctgc gctcggccct tccggctggc ?ggtt?a^g 2940 

25 ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagla c?ggggccaq 3000 
a cc ctcccgtatc gtagttatct acacgacggg gagtcaggca acSt?gSJ| 3^60 
aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg taactqtcaq 3120 
accaagttta ctcatatata ctttagattg atttaaaact tcatttttH -ttSaaagga 3180 

. tctaggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcg? 3240 

tTcTcJllt ctoc^^r gtagaaaaga tcaaaggatc ttcttgagat JcttiSSS 33 0 S 
tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttqtttqc 3360 
cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagSgl gcgcIgaJlc 3420 
aac-rltTf tC " Ctagtg tagccgtagt taggccacca cttcaagalc L^gtlgcac 3480 
cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt qqcqataaot 3540 

35 cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag JggKgggct 3600 
gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacl galctglgat 3660 
atccaat!^ tgagctat ? a gaaagcgcca cgcttcccga agggagaaag gcggacaggt 3720 
atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg 3780 

40 ^^ g9 ^ at f t ttata 9 tcct gtcgggtttc gccacctctg acttgagcgt cga??t??g? 3840 

40 gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcgglc t?tttacqqt 3900 
tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcg€tl?cc cctqa???ta 3960 
a g aloata CCg tatt »«* cc "tgagtgag ctgataccgc tcgccgcagc cgalcgaccg Mil 
gCgC&g< tT 9 tca gtgagc gaggaagcgg aagagcgccc aatacgcaaa clgcc£ctcc 4080 

45 tllatafnr. 9 gCcgattcat taa tgcagct ggcacgacag gtttcccgac tggaaagcgg 4140 
f" g ^ gagCg caa =gcaatt aatgtgagtt agctcactca ttaggcaccc caggctttal 4200 
tZttt 9 ? ! ccggctcgt atgttgtgtg gaattgtgag cggataacaa tt?2acacag 4260 

sssss :sssss g a cr aagc tctctagagc tctagagctc tagagctct * ».? 



50 



55 



60 



4344 

<210> 16 
<211> 4496 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector pTGFG82 
<400> 16 

cgcgttgaca ttgattattg actagttatt aatagtaatc aattacgggg tcattaqttc 60 
atagcccata tatggagttc cgcgttacat aacttacggt aaatggcclg cctggctq c 120 
cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata gtSacgccaa ^80 
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tagggacttt 

tacatcaagt 

ccgcctggca 

acgtattagt 

gatagcggtt 

tgttttggca 

cgcaaatggg 

ctagagaacc 

aagcttgacc 

cttctgtggt 

tataaaattt 

gatt ccaacc 

cctgttttgc 

ttctactcct 

gctaagtttt 

caccacaaag 

ctttataagt 

gcatagagtg 

ttgtaaaggg 

gccataccac 

acctgaaaca 

gttacaaata 

ctagttgtgg 

accagcttcg 

gttctggtac 

ctgggaaaac 

ctggcgtaat 

tggcgaatgg 

catatggtgc 

cccgccaaca 

acaagctgtg 

acgcgcgaga 

ccagaacatg 

ataggttaat 

tgtgcgcgga 

gagacaataa 

acatttccgt 

cccagaaacg 

catcgaactg 

tccaatgatg 

cgggcaagag 

accagtcaca 

cataaccatg 

ggagctaacc 

accggagctg 

ggcaacaacg 

attaatagac 

ggctggctgg 

tgcagcactg 

tcaggcaact 

gcattggtaa 

tttttaattt 

ttaacgtgag 

ttgagatcct 

agcggtggtt 

cagcagagcg 

caagaactct 

tgccagtggc 

ggcgcagcgg 

ctacaccgaa 

gagaaaggcg 

gcttccaggg 



ccattgacgt 

gtatcatatg 

tta tgcccag 

catcgctatt 

tgactcacgg 

ccaaaatcaa 

cggtaggcgt 

cactgcttac 

tcgagcaagc 

gtgacataat 

ttaagtgtat 

tatggaactg 

tcagaagaaa 

ccaaaaaaga 

ttgagtcatg 

gaaaaagctg 

aggcataaca 

tctgctatta 

gttaataagg 

atttgtagag 

taaaatgaat 

aagcaatagc 

tttgtccaaa 

tagctagaac 

ccccctctag 

cctggcgtta 

agcgaagagg 

cgcctgatgc 

actctcagta 

cccgctgacg 

accgtctccg 

cgaaagggcg 

atgttctagc 

gtcatgataa 

acccctattt 

ccctgataaa 

gtcgccctta 

ctggtgaaag 

gatctcaaca 

agcactttta 

caactcggtc 

gaaaagcatc 

agtgataaca 

gcttttttgc 

aatgaagcca 

ttgcgcaaac 

tggatggagg 

tttattgctg 

gggccagatg 

atggatgaac 

ctgtcagacc 

aaaaggatct 

ttttcgttcc 

ttttttctgc 

tgtttgccgg 

cagataccaa 

gtagcaccgc 

gataagtcgt 

tcgggctgaa 

ctgagatacc 

gacaggtatc 

ggaaacgcct 



caatgggtgg 

ccaagtacgc 

tacatgacct 

accatggtga 

ggatttccaa 

cgggactttc 

gtacggtggg 

tggcttatcg 

ggccgcgact 

tggacaaact 

aatgtgttaa 

atgaatggga 

tgccatctag 

agagaaaggt 

ctgtgtttag 

cactgctata 

gttataatca 

ataactatgc 

aatatttgat 

gttttacttg 

gcaattgttg 

atcacaaatt 

ctcatcaatg 

atcatgttct 

agcgaattaa 

cccaacttaa 

cccgcaccga 

ggtattttct 

caatctgctc 

cgccctgacg 

ggagctgcat 

gggtaccaga 

tacgaagctg 

taatggtttc 

gtttattttt 

tgcttcaata 

ttcccttttt 

taaaagatgc 

gcggtaagat 

aagttctgct 

gccgcataca 

ttacggatgg 

ctgcggccaa 

acaacatggg 

taccaaacga 

tattaactgg 

cggataaagt 

ataaat ctgg 

gtaagccctc 

gaaatagaca 

aagtttactc 

aggtgaagat 

actgagcgtc 

gcgtaatctg 

atcaagagct 

atactgtcct 

ctacatacct 

gtcttaccgg 

cggggggttc 

tacagcgtga 

cggtaagcgg 

ggtatcttta 



agtatttacg 

cccctattga 

tatgggactt 

tgcggttttg 

gtctccaccc 

caaaatgtcg 

aggtctatat 

aaattaatac 

ctactagagg 

acctacagag 

actactgatt 

gcagtggtgg 

tgatgatgag 

agaagacccc 

taatagaact 

caagaaaatt 

taacatactg 

tcaaaaattg 

gtatagtgcc 

ctttaaaaaa 

ttgttaactt 

tcacaaataa 

tatcttatca 

gggatatcag 

ttcactggcc 

tcgccttgca 

tcgcccttcc 

ccttacgcat 

tgatgccgca 

ggcttgtctg 

gtgtcagagg 

acatgatgtt 

gtaccccggc 

ttagacgtca 

ctaaatacat 

atattgaaaa 

tgcggcattt 

tgaagatcag 

ccttgagagt 

atgtggcgcg 

ctattctcag 

catgacagta 

cttacttctg 

ggatcatgta 

cgagcgtgac 

cgaactactt 

tgcaggacca 

agccggtgag 

ccgtatcgta 

gatcgctgag 

atatatactt 

cctttttgat 

agaccccgta 

ctgcttgcaa 

accaactctt 

tctagtgtag 

cgctctgcta 

gttggactca 

gtgcacacag 

gctatgagaa 

cagggtcgga 

tagtcctgtc 



gtaaactgcc 

cgtcaatgac 

tcctacttgg 

gcagtacatc 

cattgacgtc 

taacaactcc 

aagcagagct 

gactcactat 

atctttgtga 

atttaaagct 

ctaattgttt 

aatgccttta 

gctactgctg 

aaggactttc 

cttgcttgct 

atggaaaaat 

ttttttctta 

tgtaccttta 

ttgactagag 

cct cccacac 

gtttattgca 

agcatttttt 

tgtctggatc 

cttcgtagct 

gtcgttttac 

gcacatcccc 

caacagttgc 

ctgtgcggta 

tagttaagcc 

ctcccggcat 

ttttcaccgt 

ctagctacga 

ctcgtgatac 

ggtggcactt 

tcaaatatgt 

aggaagagta 

tgccttcctg 

ttgggtgcac 

tttcgccccg 

gtattatccc 

aatgacttgg 

agagaattat 

acaacgatcg 

actcgccttg 

accacgatgc 

actctagctt 

cttctgcgct 

cgtgggtctc 

gttatctaca 

ataggtgcct 

tagattgatt 

aatctcatga 

gaaaagatca 

acaaaaaaac 

tttccgaagg 

ccgtagttag 

atcctgttac 

agacgatagt 

cccagcttgg 

agcgccacgc 

acaggagagc 

gggtttcgcc 



cacttggcag 

ggtaaatggc 

cagtacatct 

aatgggcgtg 

aatgggagtt 

gccccattga 

ctctggctaa 

agggagaccc 

aggaacctta 

ctaaggtaaa 

gtgtatttta 

atgaggaaaa 

actctcaaca 

cttcagaatt 

ttgctattta 

attctgtaac 

ctccacacag 

gctttttaat 

atcataatca 

ctccccctga 

gcttataatg 

tcactgcatt 

cccggggggt 

agaacatcat 

aacgtcgtga 

ctttcgccag 

gcagcctgaa 

tttcacaccg 

agccccgaca 

ccgcttacag 

catcaccgaa 

agctgatatc 

gcctattttt 

ttcggggaaa 

at ccgctcat 

tgagtattca 

tttttgctca 

gagtgggtta 

aagaacgttt 

gtattgacgc 

ttgagtactc 

gcagtgctgc 

gaggaccgaa 

atcgttggga 

ctgtagcaat 

cccggcaaca 

cggcccttcc 

gcggtatcat 

cgacggggag 

cactgattaa 

taaaacttca 

ccaaaatccc 

aaggatcttc 

caccgctacc 

taactggctt 

gccaccactt 

cagtggctgc 

taccggataa 

agcgaacgac 

ttcccgaagg 

gcacgaggga 

acctctgact 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

18 60 

1920 

1960 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 
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tgagcgtcga 
cgcggccttt 
gttatcccct 
ccgcagccga 
acgcaaaccg 
tcccgactgg 
ggcaccccag 
ataacaattt 
agagctctag 
atgttctggg 



tttttgtgat 
ttacggttcc 
gattctgtgg 
acgaccgagc 
cctctccccg 
aaagcgggca 
gctttacact 
cacacaggaa 
agctctagag 
atatcagctt 



gctcgtcagg 
tggccttttg 
ataaccgtat 
gcagcgagtc 
cgcgttggcc 
gtgagcgcaa 
ttatgcttcc 
acagctatga 
agcttgcatg 
cgtagctaga 



ggggcggagc 
ctggcctttt 
taccgccttt 
agtgagcgag 
gattcattaa 
cgcaattaat 
ggctcgtatg 
ccatgattac 
ccggggtacc 
acatcatgtt 



ct atggaaaa 
get cacatgt 
gagtgagctg 
gaagcggaag 
tgcagctggc 
gtgagttagc 
ttgtgtggaa 
gccaagctct 
agettegtag 
ctggtacccc 



acgccagcaa 
tctttcctgc 
ataccgctcg 
agcgcccaat 
acgacaggtt 
tcactcatta 
ttgtgagcgg 
ctagagctct 
ctagaacatc 
ggtcga 



3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4496 
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<210> 17 
<211> 4644 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector pTGFG95 



<400> 17 

cgcgttgaca 

atageccata 

cgcccaacga 

tagggacttt 

tacatcaagt 

ccgcctggca 

aegtattagt 

gatageggtt 

tgttttggca 

cgcaaatggg 

ctagagaacc 

aagcttgacc 

cttctgtggt 

tataaaattt 

gattccaacc 

cctgttttgc 

ttctactcct 

gctaagtttt 

caccacaaag 

ctttataagt 

gcatagagtg 

ttgtaaaggg 

gccataccac 

acctgaaaca 

gttacaaata 

ctagttgtgg 

accagcttcg 

gttctggtac 

ctgggaaaac 

ctggcgtaat 

tggcgaatgg 

gtagctagaa 

gtgeggtatt 

gt taagccag 

cccggcatcc 

ttcaccgtca 

acgaagctga 

tacgectatt 

ettttegggg 

tgtatccget 

gtatgagtat 



ttgattattg 

tatggagttc 

cccccgccca 

ecattgaegt 

gtatcatatg 

ttatgcccag 

categctatt . 

tgactcaegg 

ccaaaatcaa 

eggtaggegt 

cactgcttac 

tcgagcaagc 

gtgacataat 

ttaagtgtat 

tatggaactg 

tcagaagaaa 

ccaaaaaaga 

ttgagtcatg 

gaaaaagctg 

aggcataaca 

tetgetatta 

gttaataagg 

atttgtagag 

taaaatgaat 

aagcaatagc 

tttgtccaaa 

tagctagaac 

ccccctctag 

cctggcgtta 

agegaagagg 

eggggtacca 

catcatgttc 
tcacaccgca 
ccccgacacc 
gcttacagac 
tcaccgaaac 
tatcccagaa 
tttataggtt 
aaatgtgcgc 
catgagacaa 
tcaacatttc 



actagttatt 

cgcgttacat 

ttgacgtcaa 

caatgggtgg 

ccaagtacgc 

tacatgacct 

accatggtga 

ggatttccaa 

egggacttte 

gtacggtggg 

tggcttatcg 

ggccgcgact 

tggacaaact 

aatgtgttaa 

atgaatggga 

tgecatctag 

agagaaaggt 

ctgtgtttag 

cactgetata 

gttataatca 

ataactatgc 

aatatttgat 

gttttacttg 

gcaattgttg 

atcacaaatt 

ctcatcaatg 

atcatgttct 

agegaattaa 

cccaacttaa 

cccgcaccga 

gettegtage 

tggtaccccg 

tatggtgcac 

cgccaacacc 

aagctgtgac 

gcgcgagacg 

catgatgttc 

aatgtcatga 

ggaaccccta 

taaccctgat 

cgtgtcgccc 



aatagtaatc 

aacttaeggt 

taatgacgta 

agtatttacg 

cccctattga 

tatgggactt 

tgeggttttg 

gtctccaccc 

caaaatgtcg 

aggtctatat 

aaattaatac 

ctactagagg 

acctacagag 

actactgatt 

gcagtggtgg 

tgatgatgag 

agaagacccc 

taatagaact 

caagaaaatt 

taacatactg 

tcaaaaattg 

gtatagtgcc 

ctttaaaaaa 

ttgttaactt 

tcacaaataa 

tatcttatca 

gggatatcag 

ttcactggcc 

tcgccttgca 

tcgcccttcc 

tagaacatca 

ectgatgegg 

tctcagtaca 

cgctgacgcg 

cgtctccggg 

aaagggctac 

tagctacgaa 

taataatggt 

tttgtttatt 

aaatgettea 

ttattccctt 



aattacgggg 

aaatggcccg 

tgttcccata 

gtaaactgee 

egtcaatgae 

■tcct-acttgg 

gcagtacatc 

cattgaegtc 

taacaactc.c 

aagcagagct 

gactcactat 

atctttgtga 

atttaaagct 

ctaattgttt 

aatgecttta 

getactgetg 

aaggactttc 

ettgettget 

atggaaaaat 

ttttttctta 

tgtaccttta 

ttgactagag 

cctcccacac 

gtttattgea 

agcatttttt 

tgtctggatc 

ettegtaget 

gtcgttttac 

gcacatcccc 

caacagttgc 

tgttctggga 

tattttctcc 

atetgetctg 

ccctgacggg 

agetgeatgt 

cagaacatga 

gctggtaccc 

ttcttagacg 

tttctaaata 

ataatattga 

ttttgeggea 



tcattagttc 
cctggctgac 
gtaacgccaa 
cacttggcag 
ggtaaatggc 
cagtacatct 
aatgggcgtg 
aatgggagtt 
gccccattga 
ctctggctaa 
agggagaccc 
aggaacctta 
ctaaggtaaa 
gtgtatttta 
atgaggaaaa 
actctcaaca 
cttcagaatt 
ttgetattta 
attctgtaac 
ctccacacag 
gctttttaat 
atcataatca 
ctccccctga 
gcttataatg 
teactgeatt 
cccggggggt 
agaacatcat 
aacgtcgtga 
ctttcgccag 
geagectgaa 
tatcagcttc 
ttaegcatet 
atgeegcata 
ettgtctget 
gtcagaggtt 
tgttctagct 
cgcctcgtga 
tcaggtggca 
cattcaaata 
aaaaggaaga 
ttttgectte 



60 

120 

180 

240 

300 

360 . 

420 

480 

54.0 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 
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ctgtttttgc 
cacgagtggg 
ccgaagaacg 
cccgtattga 
tggttgagta 
tatgcagtgc 
tcggaggacc 
ttgatcgttg 
tgcctgtagc 
cttcccggca 
.gctcggccct 
ctcgcggtat 
acacgacggg 
cctcactgat 
atttaaaact 
tgaccaaaat 
tcaaaggatc 
aaccaccgct 
aggtaactgg 
taggccacca 
taccagtggc 
agttaccgga 
tggagcgaac 
cgcttcccga 
agcgcacgag 
gccacctctg 
aaaacgccag 
tgttctttcc 
ctgataccgc 
agaacatgat 
ctggtacccc 
ttcattaatg 
caattaatgt 
ctcgtatgtt 
atgattacgc 
ggggtaccag 
atcatgttct 



tcacccagaa 
ttacatcgaa 
ttttccaatg 
cgccgggcaa 
ctcaccagtc 
tgccataacc 
gaaggagcta 
ggaaccggag 
aatggcaaca 
acaattaata 
tccggctggc 
cattgcagca 
gagtcaggca 
taagcattgg 
tcatttttaa 
cccttaacgt 
ttcttgagat 
accagcggtg 
cttcagcaga 
cttcaagaac 
tgctgccagt 
taaggcgcag 
gacctacacc 
agggagaaag 
ggagcttcca 
acttgagcgt 
caacgcggcc 
tgcgttatcc 
tcgccgcagc 
gttctagcta 
agcggaagag 
cagctggcac 
gagttagctc 
gtgtggaatt 
caagctctct 
cttcgtagct 
ggtaccccgg 



acgctggtga 
ctggatctca 
atgagcactt 
gagcaactcg 
acagaaaagc 
atgagtgata 
accgcttttt 
ctgaatgaag 
acgttgcgca 
gactggatgg 
tggtttattg 
ctggggccag 
actatggatg 
taactgtcag 
tttaaaagga 
gagttttcgt 
cctttttttc 
gtttgtttgc 
gcgcagatac 
tctgtagcac 
ggcgataagt 
cggtcgggct 
gaactgagat 
gcggacaggt 

gggggaaacg 

cgatttttgt 
tttttacggt 
cctgattctg 
cgaacgaccg 
cgaagctgat 
cgcccaatac 
gacaggtttc 
actcattagg 
gtgagcggat 
agagctctag 
agaacatcat 
tcga 



aagtaaaaga 
acagcggtaa 
ttaaagttct 
gtcgccgcat 
atcttacgga 
acactgcggc 
tgcacaacat 
ccataccaaa 
aactattaac 
aggcggataa 
ctgataaatc 
atggtaagcc 
aacgaaatag 
accaagttta 
tctaggtgaa 
tccactgagc 
tgcgcgtaat 
cggatcaaga 
caaatactgt 
cgcctacata 
cgtgtcttac 
gaacgggggg 
acctacagcg 
atccggtaag 
cctggtatct 
gatgctcgtc 
tcctggcctt 
tggataaccg 
agcgcagcga 
atcccagaac 
gcaaaccgcc 
ccgactggaa 
caccccaggc 
aacaatttca 
agctctagag 
gttctgggat 



tgctgaagat 
gatccttgag 
gctatgtggc 
acactattct 
tggcatgaca 
caacttactt 

gggggatcat 

cgacgagcgt 
tggcgaacta 
agttgcagga 
tggagccggt 
ctcccgtatc 
acagatcgct 
ctcatatata 
gatccttttt 
gtcagacccc 
ctgctgcttg 
gctaccaact 
ccttctagtg 
cctcgctctg 
cgggttggac 
ttcgtgcaca 
tgagctatga 
cggcagggtc 
ttatagtcct 
aggggggcgg 
ttgctggcct 
tattaccgcc 
gtcagtgagc 
atgatgttct 
tctccccgcg 
agcgggcagt 
tttacacttt 
cacaggaaac 
ctctagagag 
atcagcttcg 



cagttgggtg 
agttttcgcc 
gcggtattat 
cagaatgact 
gtaagagaat 
ctgacaacga 
gtaactcgcc 
gacaccacga 
cttactctag 
ccacttctgc 
gagcgtgggt 
gtagttatct 
gagataggtg 
ctttagattg 
gataatctca 
gtagaaaaga 
caaacaaaaa 
ctttttccga 
tagccgtagt 
ctaatcctgt 
tcaagacgat 
cagcccagct 
gaaagcgcca 
ggaacaggag 
gtcgggtttc 
agcctatgga 
tttgctcaca 
tttgagtgag 
gaggggtacc 
agctacgaag 
cgttggccga 
gagcgcaacg 
atgcttccgg 
agctatgacc 
cttgcatgcc 
tagctagaac 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
414 0 
4200 
4 2-60 
4320 
4380 
4440 
4500 
4560 
4620 
4644 
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45 



50 



55 



60 



<210> 18 
<211> 933 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Met Thr Glu Leu Lys Ala Lys Gly Pro Arg Ala Pro His Val Ala Gly 



10 



15 



Gly Pro Pro Ser Pro Glu Val Gly Ser Pro Leu Leu Cys Arg Pro Ala 

20 25 30 

Ala Gly Pro Phe Pro Gly Ser Gin Thr Ser Asp Thr Leu Pro Glu VaJ 

35 40 " 45 

Ser Ala lie Pro lie Ser Leu Asp Gly Leu Leu Phe Pro Arq Pro Cvs 

50 55 60 - 

Gin Gly Gin Asp Pro Ser Asp Glu Lys Thr Gin Asp Gin Gin Ser Leu 

65 70 75 * 80 

Ser Asp Val Glu Gly Ala Tyr Ser Arg Ala Glu Ala Thr Arg Gly Ala 

85 90 95 
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17 



Gly Gly Ser Ser Ser Ser Pro Pro Glu Lys Asp Ser Gly Leu Leu Asp 

100 105 110 

Ser Val Leu Asp Thr Leu Leu Ala Pro Ser Gly Pro Gly Gin Ser Gin 

115 120 125 

Pro Ser Pro Pro Ala Cys Glu Val Thr Ser Ser Trp Cys Leu Phe Gly 

130 " 135 140 

Pro Glu Leu Pro Glu Asp Pro Pro Ala Ala Pro Ala Thr Gin Arg Val 

145 150 155 160 



Leu Ser Pro Leu Met Ser Arg Ser Gly Cys Lys Val Gly Asp Ser Ser 

15 165 170 ~ 175 

Gly Thr Ala Ala Ala His Lys Val Leu Pro Arg Gly Leu Ser Pro Ala 

180 185 190 

20 Arg Gin Leu Leu Leu Pro Ala Ser Glu Ser Pro His Trp Ser Gly Ala 
195 200 205 



Pro Val Lys Pro Ser Pro Gin Ala Ala Ala Val Glu Val Glu Glu Glu 

210 215 220 

Asp Gly Ser Glu Ser Glu Glu Ser Ala Gly Pro Leu Leu Lys Gly Lys 

225 230 235 240 



Pro Arg Ala Leu Gly Gly Ala Ala Ala Gly Gly Gly Ala Ala Ala Val 

30 245 250 255 

Pro Pro Gly Ala Ala Ala Gly Gly Val Ala Leu Val Pro Lys Glu Asp 
260 265 270 

35 Ser Arg Phe Ser Ala Pro Arg Val Ala Leu Val Glu Gin Asp Ala Pro 
275 280 285 



Met Ala Pro Gly Arg Ser Pro Leu Ala Thr Thr Val Met Asp Phe lie 

290 " 295 300 

His Val Pro lie Leu Pro Leu Asn His Ala Leu Leu Ala Ala Arg Thr 

305 310 315 320 



Arg Gin Leu Leu Glu Asp Glu Ser Tyr Asp Gly Gly Ala Gly Ala Ala 
45 325 .330 335 

Ser Ala Phe Ala Pro Pro Arg Ser Ser Pro Cys Ala Ser Ser Thr Pro 
340 " 345 350 

50 Val Ala Val Gly Asp Phe Pro Asp Cys Ala Tyr Pro Pro Asp Ala Glu 

355 ' 360 365 ■ 



Pro Lys Asp Asp Ala Tyr Pro Leu Tyr Ser Asp Phe Gin Pro Pro Ala 

370 375 380 

Leu Lys lie Lys Glu Glu Glu Glu Gly Ala Glu Ala Ser Ala Arg Ser 

385 390 395 400 



Pro Arg Ser Tyr Leu Val Ala Gly Ala Asn Pro Ala Ala Phe Pro Asp 
60 405 410 415 
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25 



Phe Pro Leu Gly Pro Pro Pro Pro Leu Pro Pro Arg Ala Thr Pro Ser 
420 425 430 

Arg Pro Gly Glu Ala Ala Val Thr Ala Ala Pro Ala Ser Ala Ser val 
4 35 440 445 

Ser ffn Ala Ser SSr Ser Gly Ser Thr Leu Glu c ys He Leu Tyr Lys 
450 455 4 60 

Ala Glu Gly Ala Pro Pro Gin Gin Gly Pro Phe Ala Pro Pro Pro Cys 
465 470 475 4 | 0 

Lys Ala Pro Gly Ala Ser Gly Cys Leu Leu Pro Arg Asp Gly Leu Pro 
15 485 490 " 4 95 

Ser Thr Ser Ala Ser Ala Ala Ala Ala Gly Ala Ala Pro Ala Leu Tvr 
500 505 510 

20 Pro Ala Leu Gly Leu Asn Gly Leu Pro Gin Leu Gly Tyr Gin Ala Ala 
515 520 525 

Val Leu Lys Glu Gly Leu Pro Gin Val Tyr Pro Pro Tyr Leu Asn Tyr 
530 535 54 0 

Leu Arg Pro Asp Ser Glu Ala Ser Gin Ser Pro Gin Tyr Ser Phe Glu 
545 550 555 560 

Ser Leu Pro Gin Lys lie Cys Leu He Cys Gly Asp Glu Ala Ser Gly 
30 565 570 575 

Cys His Tyr Gly Val Leu Thr Cys Gly Ser Cys Lys Val Phe Phe Lys 
580 585 590 

35 Arg Ala Met Glu Gly Gin His Asn Tyr Leu Cys Ala Gly Arg Asn Asp 
595 600 605 

Cys lie Val Asp Lys He Arg Arg Lys Asn Cys Pro Ala Cys Arq Leu 
610 615 6 20 

40 

Arg Lys Cys Cys Gin Ala Gly Met Val Leu Gly Gly Arg Lys Phe Lys 
625 630 635 640 

Lys Phe Asn Lys Val Arg Val Val Arg Ala Leu Asp Ala Val Ala Leu 
45 645 650 655 

Pro Gin Pro Leu Gly Val Pro Asn Glu Ser Gin Ala Leu Ser Gin Arq 
660 665 670 

50 Phe Thr Phe Ser Pro Gly Gin Asp He Gin Leu He Pro Pro Leu He 
675 680 685 

Asn Leu Leu Met Ser He Glu Pro Asp Val He Tyr Ala Gly His Asp 
55 690 695 700 

Asn Thr Lys Pro Asp Thr Ser Ser Ser Leu Leu Thr Ser Leu Asn Gin 
705 710 715 720 

Leu Gly Glu Arg Gin Leu Leu Ser Val Val Lys Trp Ser Lys Ser Leu 
OU 725 730 735 
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Pro 
Tyr 
Lys 



Asn 
785 

Met 



Glu 



Glu 



He 



Ser 
865 

His 



Gin 
He 
Leu 



Gly Phe Arg Asn Leu His 
740 

Ser Trp Met Ser Leu Met 
755 



His Val Ser Gly 
770 

Glu Gin Arg Met 



Trp Gin He Pro 
805 

Phe Leu Cys Met 
820 

Gly Leu Arg Ser 
8 35 

Arg Glu Leu He 
850 

Ser Ser Gin Arg 



Asp Leu Val Lys 
885 

Ser Arg Ala Leu 
900 

Ala Ala Gin Leu 
915 

Phe His Lys Lys 
930 



Gin Met 
775 

Lys Glu 
790 

Gin Glu 

Lys Val 

Gin Thr 

Lys Ala 
855 

Phe Tyr 
870 

Gin Leu 
Ser Val 



He Asp Asp Gin 
745 

Val Phe Gly Leu 
760 

Leu Tyr Phe Ala 



Ser Ser Phe Tyr 
795 

Phe Val Lys Leu 
810 

Leu Leu Leu Leu 
825 

Gin Phe Glu Glu 
840 

He Gly Leu Arg 



Gin Leu Thr Lys 
875 

His Leu Tyr Cys 
890 

Glu Phe Pro Glu 
905 



Pro Lys He Leu Ala Gly 
920 



He Thr Leu 
750 

Gly Trp Arg 
765 

Pro Asp Leu 
780 

Ser Leu Cys 

Gin Val Ser 

Asn Thr He 
830 

Met Arg Ser 
545 

Gin Lys Gly 
860 

Leu Leu Asp 
Leu Asn Thr 



Met Met Ser 
910 

Met Val Lys 
925 



He Gin 
Ser Tyr 
He Leu 



Leu Thr 
800 

Gin Glu 
815 

Pro Leu 



Ser Tyr 

Val Val 

Asn Leu 
880 

Phe He 
895 

Glu Val 
Pro Leu 



<210> 19 

<211> 2970 

<212> DNA 

<213> Homo sapiens 



<400> 19 

ctgaccagcg 

cacattcaac 

tcccttttcc 

tgagctgaag 

ggtcggatcc 

ggacaccttg 

gccctgccag 

cgtggagggc 

tcccccagaa 

aggtcccggg 

gtttggcccc 

cccgctcatg 

taaagtgctg 

ccctcactgg 

ggaggaggat 

ggctctgggt 



ccgccctccc 
acccactttc 
ctcctccctg 
gcaaagggtc 
ccactgctgt 
cctgaagttt 
ggacaggacc 
gcatattcca 
aaggacagcg 
cagagccaac 
gaacttcccg 
agccggtccg 
ccccggggcc 
tccggggccc 
ggctctgagt 
ggcgcggcgg 



ccgcccccga 
tcctccctct 
gagacggggg 
cccgggctcc 
gtcgcccagc 
cggccatacc 
cctccgacga 
gagctgaagc 
gactgctgga 
ccagccctcc 
aagatccacc 
ggtgcaaggt 
tgtcaccagc 
cagtgaagcc 
ccgaggagtc 
ctggaggagg 



cccaggaggt 
gcccctatat 
aggagaaaag 
ccacgtggcg 
cgcaggtccg 
tatctccctg 
aaagacgcag 
tacaaggggt 
cagtgtcttg 
cgcctgcgag 
ggctgccccc 
tggagacagc 
ccggcagctg 
gtctccgcag 
tgcgggtccg 
agccgcggct 



ggagatccct 
tcccgaaacc 
gggagtccag 
ggcggcccgc 
ttcccgggga 
gacgggctac 
gaccagcagt 
gctggaggca 
gacactctgt 
gtcaccagct 
gccacccagc 
tccgggacgg 
ctgctcccgg 
gccgctgcgg 
cttctgaagg 
gtcccgccgg 



ccggtccagc 
ccctcctcct 
tcgtcatgac 
cctcccccga 
gccagacctc 
tcttccctcg 
cgctgtcgga 
gcagttctag 
tggcgccctc 
cttggtgcct 
gggtgttgtc 
cagctgccca 
cctctgagag 
tggaggttga 
gcaaacctcg 
gggcggcagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 
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aggaggcgtc 
ggtggagcag 
tttcatccac 
gctgctggaa 
gcggagttca 
gtacccgccc 
gcccgctcta 
ttcctacctt 
gcccccgctg 
acccgccagt 
-gtacaaagcg 
gccgggcgcg 
cgccgccgcc 
gctcggctac 
caactacctg 
acctcagaag 
.. tacctgtggg 
atgtgctgga 
tcgccttaga 
caataaagtc 
tccaaatgaa 
gttgattcca 
acatgacaac 
cgagaggcaa 
acatattgat 
tctaggatgg 
aatactaaat 
gcagatccca 
agtattgtta 
ggagatgagg 
agttgtgtcg 
tcttgtcaaa 
tgttgaattt 
agggatggtg 



gccctggtcc 

gacgcgccga 

gtgcctatcc 

gacgaaagtt 

ccctgtgcct 

gacgccgagc 

aagataaagg 

gtggccggtg 

ccgccgcgag 

gcctcagtct 

gagggcgcgc 

agcggctgcc 

ggggcggccc 

caggccgccg 

aggccggatt 

atttgtttaa 

agctgtaagg 

agaaatgact 

aagtgctgtc 

agagttgtga 

agccaagccc 

ccactgatca 

acaaaacctg 

cttctttcag 

gaccagataa 

agatcctaca 

gaacagcgga 

caggagtttg 

cttcttaata 

tcaagctaca 

agctcacagc 

caacttcatc 

ccagaaatga 

aaaccccttc 



ccaaggaaga 

tggcgcccgg 

tgcctctcaa 

acgacggcgg 

cgtccacccc 

ccaaggacga 

aggaggagga 

ccaaccccgc 

cgaccccatc 

cgtctgcgtc 

cgccccagca 

tgctcccgcg 

ccgcgctcta 

tgctcaagga 

cagaagccag 

tctgtgggga 

tcttctttaa 

gcatcgttga 

aggctggcat 

gagcactgga 

taagccagag 

acctgttaat 

acacctccag 

tagtcaagtg 

ctctcattca 

aacatgtcag 

tgaaagaatc 

tcaagcttca 

caattccttt 

ttagagagct 

gtttctatca 

tgtactgctt 

tgtctgaagt 

tctttcataa 



ttcccgcttc 

gcgctccccg 

tcacgcctta 

ggccggggct 

ggtcgctgta 

cgcgtaccct 

aggcgcggag 

agccttcccg 

cagacccggg 

ctcctcgggg 

gggcccgttc 

ggacggcctg 

ccctgcactc 

gggcctgccg 

ccagagccca 

tgaagcatca 

gagggcaatg 

taaaatccgc 

ggtccttgga 

tgctgttgct 

attcactttt 

gagcattgaa 

ttctttgctg 

gtctaaatca 

gtattcttgg 

tgggcagatg 

atcattctat 

agttagccaa 
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